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Abstract. The question we address in this paper is the 
determination of the correlation properties of the dark 
matter halos appearing in cosmic density fields once they 
underwent a strongly nonlinear evolution induced by grav- 
itational dynamics. A series of previous works have given 
indications of what non-Gaussian features are induced by 
nonlinear evolution in term of the high-order correlation 
functions. Assuming such patterns for the matter field, 
that is that the high-order correlation functions behave 
as products of two-body correlation functions, we derive 
the correlation properties of the halos, that are assumed to 
represent the correlation properties of galaxies or clusters. 

The hierarchical pattern originally induced by gravity 
is shown to be conserved for the halos. The strength of 
their correlations at any order varies, however, but is found 
to depend only on their internal properties, namely on the 
parameter x oc m/r^ ' where m is the mass of the halo, 
r its size and 7 is the power law index of the two-body 
correlation function. This internal parameter is seen to be 
close to the depth of the internal potential well of virialized 
objects. We were able to derive the explicit form of the 
generating function of the moments of the halo counts 
probability distribution function. In particular we show 



explicitely that, generically, Sp{x) 
halo limit. 
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Various illustrations of our general results are pre- 
sented. As a function of the properties of the underly- 
ing matter field, we construct the count probabilities for 
halos and in particular discuss the halo void probability. 
We evaluate the dependence of the halo mass function on 
the environment: within clusters, hierarchical clustering 
implies the higher masses are favored. These properties 
solely arise from what is a natural bias (i.e., naturally in- 
duced by gravity) between the observed objects and the 
unseen matter field, and how it manifests itself depending 
on which selection effects are imposed. 

Key words: Cosmology: theory - large scale structure of 
the universe - galaxy: correlations - dark matter 
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1. Introduction 

One of the major pending cosmological problem for the 
formation of galaxies and clusters, their distribution and 
evolution, is the relation of the latter with the initial con- 
ditions prevailing in the early universe. To this sake, most 
of the observational constraints are relative to the lumi- 
nous part of the universe. On the other hand, theoretical 
model of primordial fluctuations are for the matter dis- 
tribution. Whether light traces matter, so that the former 
can be used to constrain the latter, has been a long debate 
over the past decade. It was clear that from the constraint 
on the small-scale peculiar velocity dispersion of galaxies 
(Davis & Peebles 1983), matter cannot trace light (Kaiser 
1984, Davis et al. 1985, Schaeffer & Silk 1985) if the den- 
sity of the universe is to be equal to the critical density. 

That galaxies might be biased with respect to the mat- 
ter field has therefore been quite popular in the past years 
especially in the frame of the Cold Dark Matter cosmolog- 
ical model. Standard CDM model provides indeed enough 
power at small scale for hierarchical clustering to occur 
but it does not not produce enough power at large scales, 
when correctly normalized at small scales, to explain the 
large scale galaxy counts (Saunders et al. 1991, Davis et 
al. 1992). This problem has been confirmed by the mea- 
surement of the very large scale cosmological fluctuations 
by Smoot et al. (1992). 

It is therefore now customary to work within the as- 
sumption of a low-density universe (with a possible cosmo- 
logical constant), that preserves the hierarchical clustering 
scenario, and with the idea that light might trace matter. 
It is then crucial to understand the possible existence of 
biases at smaller scales and their possible variation with 
scale to establish reliable constraints on the shape of the 
power initial spectrum. 

Bardeen et al. (1986) proposed a mechanism for the 
galaxies to be more correlated that the matter. It relies of 
the correlation properties of the peaks in the initial Gaus- 
sian density field. This approach was further extended by 
Mo, Jing & White (1997). The idea is that galaxies form at 
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the location of the density peaks and thus are biased from 

the beginning compared to the whole matter field. How- 
ever such a description is far from being complete since 
the subsequent complex dynamical evolution of the den- 
sity field is ignored. At scales up to 8 h~^Mpc the density 
field is completely nonlinear so that the statistical proper- 
ties of the peaks appearing in the initial Gaussian fluctu- 
ation may have been completely transformed. We present 
below arguments that take into account the nonlinear evo- 
lution of the matter field to show that the halos in such 
an evolved density field indeed are distributed differently 
than the matter. But definitely not in the way predicted 
by gaussian fluctuations. We present the complete correla- 
tion properties that are to be expected and the expression 
of the biases that appear at various levels in the nonlinear 
regime. 

The small-scale power law behavior of the galaxy cor- 
relation function is specific to hierarchical clustering sce- 
narios and arises from the nonlinear instabilities in an ex- 
panding universe. The value of the small-scale power law 
index is likely to come from the shape of the primordial 
power spectrum (Davis & Peebles 1977). Some authors, 
however, (Saslaw & Hamilton 1984) propose an explana- 
tion based on thermodynamical arguments to explain the 
emergence of such a power law behavior. In any case it 
is thought to be due of a relaxation process in the non- 
linear regime. Generalizing a relation of Peebles (1980), 
Hamilton et al. (1991) propose (see also Valageas & Scha- 
effer 1997 -hereafter VS97-) a universal transformation to 
get the evolved non-linear two-point correlation function 
from the initial spectrum, based on empirical observations 
in numerical simulations. The strength of the matter two- 
body correlation function is obviously an important infor- 
mation to derive the initial matter power spectrum, but 
the differences between linear regime and nonlinear regime 
is unlikely to be reducible solely to a transformation of the 
fluctuation spectrum. Indeed the observed three-point cor- 
relation function of the galaxies, for instance, also takes 
a specific form, as product of two two-body correlation 
functions (Groth & Peebles 1977) and can provide alterna- 
tive checks for the scenario of structure formation. These 
features cannot be predicted by analytical calculations us- 
ing the simple linear approximation with initial Gaussian 
fluctuations. This failure is not due to inapproriate initial 
conditions but to the fact that the linear approximation is 
inadequate. Perturbative calculations introducing higher 
order of the overdensity field have demonstrated that the 
gravity can induce a full hierarchy of correlations starting 
with Gaussian initial conditions (Fry 1984b, Bernardeau 
1992, 1994). 

The scaling due to hierarchical clustering can be ex- 
pressed through the behavior of the mean p-body con- 
nected correlation functions of the matter fleld within a 
volume V, ^p{v), as a function of the two-body one (see 



Balian & Schaeffer 1989 -hereafter BaS89- for a descrip- 
tion of this statistical tools). This relation can be written, 

U^) = Sp\^,iv)Y-\ (1) 

where the coefficient Sp are independent of the scale. 
When the fluctuations are small (C2(^) ~^ 0), one can 
derive the full series of the coefficients Sp. Unfortunately 
such results in the quasi-Gaussian regime are irrelevant for 
the fully nonlinear regime where numerous shell crossings 
and relaxation processes have to be taken into account. 
Explanations for the observed structures describing the 
dynamics of pressure-less particles in gravitational interac- 
tion that do not assume the existence of coherent velocity 
fiows are to be invoked. This hierarchy of equations (The 
Born, Bogolubov, Green, Kirkwood, Yvon -BBGKY- hi- 
erarchy) concerns the p-body density functions (in the full 
phase space) and has been established by Peebles (1980) 
for matter in an expanding universe. It cannot be solved 
in general, although there exist some interesting attempts 
(Hamilton 1988b, Balian & Schaeffer 1988, Hamilton et al. 
1991) but admit the so-called self-similar solutions (Davis 
& Peebles 1977). The latter solutions contain a precious 
indication since it can be shown that, in the limit of large 
fiuctuations, a similar relationship as in (1) is expected to 
occur between the mean correlation functions. Models of 
matter fiuctuations in which such a behavior is seen are 
called hierarchical models. That such solutions are rele- 
vant for the evolved density field was also not obvious 
and had been the subject of many papers. After the suc- 
cess of early models (Schaeffer 1984, 1985), a complete 
description of the properties that are to be expected for 
the counts in cell was given by BaS89 (see §2.2.1 of this 
paper for a very short review). These predictions have now 
been widely checked, for points in numerical simulations 
(see Valageas, Lacey & Schaeffer 1999) and in catalogues 
(see Benoist et al. 1998). We take now for granted that 
the hierarchy (1) holds 

For the analysis of galaxy catalogues, however, it is 
more or less assumed that the galaxy field traces the un- 
derlying density field, or at least that the properties of 
the matter should be preserved in the galaxy field. The 
question we then have to address is to confidently re- 
late properties concerning matter distributions to obser- 
vational statistical quantities such as the two and three- 
point galaxy or cluster correlation functions, their cross- 
correlations or the galaxy and cluster void probability 
functions... In the Gaussian approximation, the knowledge 
of the p-body density functions reduces to the behavior of 
the two-body function. The bias, measuring the ratio of 
the strength of the two-point correlation function of the 
galaxies to the one of the matter is then the only relevant 
parameter, as stressed previously, to quantify the depar- 
ture between mass fluctuations and would-be light fluctu- 
ations. It appears to be a crucial parameter to constrain 
the cosmological models. However, the actual departure 
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between galaxy distribution and matter distribution can- 
not be reduced to this sole parameter. More generally we 
have to search how the p-body correlation functions are 
modified for the observed galaxy distribution. 

Previous work (Bernardeau & SchaefFer 1991 -hereafter 
BeS91-, VS97, Valageas & Schaeffer 1998) suggests a nat- 
ural way to identify the visible objects among the matter 
fluctuations. It is based on the assumption that dissipativc 
processes are epiphenomena that do not modify the dis- 
tribution of matter, but just mark, with luminous stars, 
the dense matter concentrations. At scales smaller than 
the correlation length it appears (BaS89) that most of the 
volume of the universe is (nearly) empty whereas a small 
fraction of its volume contains almost all the matter. The 
latter fraction of the universe is identified with the ob- 
served luminous objects. It happens that there is a unique 
parameter that governs the mass distribution function of 
the objects. 



X 



I! 



(2) 



where m is the mass of the object, p the mean mass density 
of the universe, v the volume of the objects and ^ the mean 
value of the matter two-point correlation function within 
a cell of volume v. As ^ is seen to behave as where 
r is the size of the object and 7. the power-law index of 
the two-body correlation function is close to 1.8, x scales 
as m/r^~^ and is then nearly proportional to the internal 
velocity dispersion of these objects, (which by definition 
are supposed to be dense enough to be virialized). 

The main purpose of this paper is then to show how the 
galaxy and cluster correlations are related to the fluctua- 
tions of the matter density in the deeply nonlinear regime. 
In practice, we divide the universe into cells and calculate 
the correlation function of those cells containing a given 
amount of matter (above a rather small minimum). In 
the following we may refer to these dense cells as "halos" . 
All the calculations have been done for hierarchical mod- 
els. More specifically we introduced the tree-hierarchical 
models for which a complete description of the biases can 
be derived. These models form a large part of the hierar- 
chical models where the many-body correlation functions 
behave as a power of scale, and can be parameterized as 
products of the two-body correlation functions. They are 
described in Sect. 2. 

A calculation of the bias parameter was performed 
(Schaeffer 1985, 1987) using specific tree-hierarchical 
based models. In a previous paper Bernardeau & Scha- 
effer (1992, hereafter BeS92) performed the calculation of 
the two- and three-point correlation functions of the dense 
spots in the matter field using a much more general form of 
tree-hierarchical models. These calculations were recently 
extended by Munshi et al. (1998b) up to the 6-point func- 
tions exploiting the same techniques and models. Once 
again the scaling parameter x was seen to play a central 
role. The bias, the parameter Q3 measuring the strength 



of the three-point correlation function for halos, were both 

shown to depend only on their internal scaling parameter 
X. In Sect. 3 of this paper we present the generalization 
of these results to the correlations at any order. The com- 
plete p-body correlations of condensed objects are thus 
derived from the matter p-body correlation functions. 

Sect. 4 is devoted to the discussion of the various possi- 
ble models, still within the tree form for the matter corre- 
lations, that can be realistically (or less realistically) used 
to describe the matter field. The range of the prediction for 
the halo correlations depending on which model is adopted 
is discussed in great detail. 

In Sect. 5 we discuss various possible applications of 
these results, that may be checked cither in galaxy cata- 
logues or in numerical simulations. We consider the first 
few moments of the counts of objects, the void probability 
functions. Special care is devoted to the determination of 
the mass multiplicity function of halos in over-dense areas 
(rich clusters). 

In the last section we strike the balance of what have 
been obtained and present some possible developments. 
The mathematical aspects of the derivations are given in 
the appendices. 

Preliminary account of this work, that contains all 
the major results presented here, has been given by 
Bernardeau (PhD Thesis, Paris 1992). After near final 
completion of this paper, we learned about the overlap- 
ping work by Munshi et al. (1998b). 

2. The nonlinear behavior 

2.1. The shape of the matter correlation functions 

We assume that the correlation functions in the matter 
field follow a hierarchical pattern as in Eq. (1). More pre- 
cisely we assume that there are solutions of the equations 
of motion for the dynamical evolution of the matter dis- 
tribution, for which all many-body correlation functions 
exhibit a scaling law. 



f2 = l, ^p(Ari, . . . , Arp,/ii) = 
^^p(i'i) • • • ) ^)) 



2/3(3-7) 



AT 



(3) 



where is the connected p-point correlation function as 
a function of the p comoving coordinates ri , . . . , r^ and 
time t (Davis & Peebles 1977). The scaling laws shown in 
this equation are valid for an Einstein-de Sitter universe 
and power-law initial conditions. For the latter, in case of 
<C 1, an other similar scaling is reached in which the 
coefficient 2/3 appearing in equation (3) has to be changed 
in 1, 



^ < 1> $p(Ari,...,Arp,/it) = 
x^p(ri, . . . , Yp, t), 



3-7 



AT 



p-i 



(4) 
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which only changes the time dependence of these func- 
tions (Balian & Schacffcr 1988). The latter have argued 
that, provided either = 1 or <g; 1 so that the above 
solutions exist, they may be reached for more general ini- 
tial conditions. Numerical simulations show (Davis ct al. 
1985, Bouchet et al. 1991, Colombi et al. 1994, 1995, 1996, 
Munshi et al. 1998a, Valageas et al. 1999) that (3) is in- 
deed relevant at late times, at least for 17 = 1 and p = 2,3. 
The matter correlation functions cannot be obtained from 
observation, but the galaxy correlations for p = 2,3 also 
are seen to obey Eq. (3) or Eq. (4) (for the scale depen- 
dence). It is thus fair to assume that either Eq. (3) or 
Eq. (4) holds also for the higher p-body correlation func- 
tions, and can be used as a model to describe the matter 
fluctuations in the nonlinear regime. 

Once the two-point correlation function is assumed to 
obey (3) or (4), a further simplification can be introduced 
by assuming that can be written as a product of p — 1 
two-body correlation functions, 

ep(ri,...,rp) = Q^"Hri,..-,rp) 

trees (a) 

X E n^2(r,,r,), (5) 

labels toe links 

where (a) is a particular tree topology (Fig. 1) connecting 
the p points without making any loop, ta is a particu- 
lar labeling of the p points by the coordinates ri, . . . ,rp, 
the sum corresponding to all different possible labelings 
for the given topology (a) and the last product is made 
over the p — 1 links between the p points with two-body 
correlation functions. We note for later use that, for each 
topology (a), we can list the number of vertices having 
a given number, q, of outgoing lines, q = 1, . . . , oo . The 
weight Qp"^ is independent of time and is a homogeneous 
function of the positions, 

Q(«)(Ari, . . . , Arp) = Q(«)(ri, . . . ,rp), (6) 

associated with the order of the correlation and the topol- 
ogy involved. 

Several attempts have been made using the form (5). 
The functions Qp"^ are generally assumed to be indepen- 
dent of the coordinates ri, . . . , rp, and sometimes also, for 
simplicity, to be independent of a. For instance Schaef- 
fer (1984, 1985) uses Qp = {2P-'^/py , u = 1 ± 1, con- 
stant or moderately increasing with p, justified on obser- 
vational grounds, so as to produce enough power towards 
the small masses (power-law increase) and a slow enough 
fall-off of the induced multiplicity function, which must 
not be faster than exponential. This implies a very strong 
growth, as pi, of the p-body correlation function ampli- 
tude. Fry (1984a) proposed Qp = 1/2 (4Q/p)^"^ p/(p - 1), 
decreasing roughly like 1 /pi as a solution of the BBGKY 
hierarchy. This implies correlation functions that grow at 
most as a power of p. The associated mass function, for 




Fig. 1. An example of contribution to the 5-point correlation 
function. The term represented in the figure reads ^(ri,r2) 
U3{ri - r2,r3 - r2, r4 - r2)C(r2, r3)^(r4, r2)t'2(r2 - r4, rr, - r2) 
^(r2,r5), with the convention = 1. It has 3 vertices with 1 
outgoing line, each being weighted by f i , 1 vertex with 2 linos 
weighted by 1/2 and 1 vertex with 3 lines weighted by vs. Each 
link is weighted by a factor ^. These weights depend on the 
labels ri . . . r.s attached to the vertices. In case of the minimal 
trcc-hicrarchical model the vertices 1^2, f 3 , . ■ ■ are independent 
of the positions of the points; otherwise they are homogeneous 
functions of the lines to which they are connected. 

counts far from the mean, then differs by several orders 
of magnitude from the previous one. This result, however, 
was obtained by means of approximations that were later 
seen (Hamilton 1988b) to be unjustified, the accumulated 
error on Qp being precisely of the order of pi. The latter 

found an approximate solution where Qp"^ vanishes for all 
tree topologies, except for the particular line-topology, for 
which Qp"'' — QP~'^, indeed of moderate variation as a 
function of p since Q > 1/2, in this case, is of order unity. 

There are, however, strong observational constraints 
on these coefficients due to the fact that they determine 
the shape of the mass function of objects (Schaeffer 1984, 
1985, BaS89, BeS91, VS97). For instance the solution pro- 
posed by Hamilton (1988b) is seen to produce still too 
sharp a cut-off at high masses (although exponential, as 
desired) as well as at low masses, where the power-law be- 
havior is lacking and is not realistic enough. For this rea- 
son, BaS89 made a study valid for all models that satisfy 
a well-established set of known constraints arising either 
from mathematical consistency (such as the positivity of 
probabilities) or from the requirement that the mass func- 
tion should look like the Schechter (1976) function. 

2.1.1. The minimal tree- hierarchical model 

The model proposed by Schaeffer as well as the form pro- 
posed by Hamilton are both a particular case of a more 
general form, 

^p,minimal ~ U ^'^^ 

vertices of (a) 

where i/q is a constant weight associated to a vertex of 
the tree topology with q outgoing lines. The value of Qp"-* 
is then obtained by the product on all the weights asso- 
ciated with the topology {a}. The form (7), assumed to 
hold for the matter correlations, has been used (BeS92) 



Halo correlations in nonlinear cosmic density field 



5 



to calculate the two- and three-body correlation func- 
tions for galaxies or clusters. Wc call it here the min- 
imal tree-hierarchical model. This form is supported by 
numerical evidences for at least the bi-spectrum which is 
found to be very weakly dependent on the geometry in the 
nonlinear regime (see Scoccimarro et al. 1998 and refer- 
ences therein). This is also assumed for the construction of 
the so-called Hyperextended Perturbation Theory model 
(Scoccimarro & Frieman 1999). 

2.1.2. The general tree-hierarchical model 

But the aim of this article is to find the relation between 
the distribution of the matter and the luminous (dense) 
objects, using the most general possible form for the many- 
body correlation functions compatible with (5). The most 
general form that is relevant to the kind of calculations 
we develop in this paper is of the form 

Qi"^= n i'«(r.i-ri,...,r,,-rO, (8) 

vertices 

the product being over all vertices associated to the tree- 
topology (a); the position of the i*'' vertex is at and 

, . . . , are the coordinates of the end-points of its q 
outgoing lines. The function Uq{rsi—ri, . . . , r^^— r^) is sup- 
posed to be an homogeneous fimction of the positions. In 
general it is then supposed that the weight associated to 
the vertices may depend not only on the number of outgo- 
ing lines as in the minimal models, but also on the geome- 
try of these lines such as their relative angles, or the ratios 
of their lengths. This very general form of the correlations 
will be referred to as the tree hierarchical m,odel. 

If there is no definitive proof that the many-body cor- 
relation functions take the form (8) in the strongly non- 
linear regime, this kind of structure is however obtained 
in the quasi-linear regime (Fry 1984b, Bernardeau 1992) 
where the correlation functions follow a tree like pattern. 
In the following we will mention the properties that are 
specifically due to the hypothesis (8) once the form (3) 
has been assumed. 

2.2. Matter distribution in cells 

To calculate the number density of the dense and under- 
dense cells as a function of the mass (i.e., the number of 
matter particles) it has been shown (BaS89) that it is suf- 
ficient to know the average of the many-body correlations 
within a cell, as is exemplified by (1). We recall here these 
findings, in a language that is somewhat different from 
the one used in these earlier publications, but that will be 
similar to the one relevant to the purpose of this paper. 

Let us divide the universe in cells of given scale, or 
volume V. The counts in cells, p{m)dm, that is the prob- 
ability of finding a given amount of matter in a randomly 



chosen cell, can be obtained (White 1979, BaS89) from a 
generating function that depends only on the averages, 




This implies that the counts are defined by the knowledge 
of a set of numbers Sp, p = 3, . . . , oo (to which we add 
Si — S2 = 1 for convenience), 

Sp = ^ (10) 

which are independent of cell size and of time, provided 
(3) holds. The counts in cells are then given by (BaS89), 

p(^)dm = — ^ e^y-^{v)ri , (n) 

^ ' rnj_,^ 27ri ^ ' 

With X = — , rUc = p V ^, 
where ip{y) reads, 

^{y) = -E(-i)"^f ^- (12) 

We assume in this paper the coefhcients Sp are given. 

Within the minimal tree- hierarchical model (7), each 
of the coefficients Sp is related to the p — 1 first coefficients 
Vq (Appendix A). As a result 1/2 can then be calculated 
from 5*3, ... from Sp. In case of the minimal tree- 

hierarchical model, the matter correlation functions are 
thus entirely determined by the values of the coefficient 
Sp, and thus by the shape of the mass distribution. For 
the general tree hierarchical models, however, the count- 
in-cells are seen (A22) to depend on a suitably defined 
average over the scale-dependent vertices i/q 

iyg= J^ — l^g{v-Vu...,V- Vq) [[ — . (13) 

So, in case only the coimt-in-cells are considered, the 
tree hierarchical model and the minimal tree hierarchical 
model lead to similar, and in practice, indistinguishable 
results. However, a priori, the underlying correlations of 
the general tree hierarchical models might be quite differ- 
ent and more information than the count-in-cells is needed 
to determine all other probabilities. Nevertheless, for the 
count-in-cells, both models can be described in terms of 
constant coefficients Uq. The relation between Sp and Vq 
can be explicited through the generating function of the 
vertices Vq, 

C(r)=E(-l)%Tr' (14) 

9=0 ^" 
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from which ip{y) is easily obtained (BeS92) by summing 
(see e.g., Jannink & des Cloiseaux 1987) the so-called 
"tree-graph series" : 



<p{y) = yC{r) - 



where r is given as a function of y by the equation, 

T = -yC'{r). (16) 

The knowledge of the function ip{y) or ((t) determines the 
multiplicity of cells as a function of their matter content. 
A useful relation, that is readily derived from (15) and 
(16), is 



dy 



C(t). 



(17) 



2.3. Model-independent properties of the probability distri- 
bution 

The general properties of the cell distribution p{m,), Gaus- 
sian or non-Gaussian regime, behavior for small or large 
masses in the nonlinear regime can be established, pro- 
vided (fi (and hence C) satisfies some very general con- 
straints (BaS89). 

These properties, called "model independent" , concern 
any hierarchical model (and not only the tree models con- 
sidered here). First, (p{y) should be a power-law at large 
y, or: T respectively: 



(p{y) oc ay 



l-Ul 



0<w<l, a«l 



This implies for C,{t), at large r 



C(t) oc ct K 



2uj 
1 - w 



>0, cf«l. 



(18) 



(19) 



Secondly, the series (12) must have a very small radius 
of convergence, equal to l/x, ~ 0.1. This may be due to 
a rapid divergence of C(r) but more likely because of a 
singularity brought by the implicit equation (16) at y = 
ys = — 1/a;,, which must thus be quite close to the origin. 
This calls also for a small value of the associated value r, . 
The latter situation is the one encountered in the quasi- 
Gaussian regime (Bernardeau 1992). 

Finally, the choice (11) for defining x and rric implies, 
for small y and r, that the following expansions are valid. 



C(t) ~ 1 - r . 



(20) 
(21) 



For the dense cells, the multiplicity function in the 
non- linear regime, ^ ;:S> 1, takes the scaling form (BaS89), 



p{m) 



1 



dy 
J 27ri 



^{y)e-y, 



(22) 



that can be written, 
1 



p{m) 



rUc 



(15) provided 



1/(1-^) 



(23) 



(24) 



The latter constraint ensures that the matter content of 
the cells is large enough. We will refer to these cells as 
"halos". 

The function h{x) shows that the dependences on size, 

density and mass only enter through the ratio x = m/rUc 
(see Eqs. [1] and [11]). It is related to ip{y) by 



/ + 100 
-ioo 



dy 
27ri 



(25) 



It is defined only for x strictly positive and behaves as 
h{x) oc a(l - w)/r(w) x"-^ for < < a; < 1, (26) 
with 

1/(1-0.) 



Xy 



and 



rriv 



h{x) oc a;K-i)e-^/^» for a; » 1 



(27) 



(28) 



with, generically, for tree-hierarchical models lo^ = —3/2 . 
The mass multiplicity function thus indeed is quite similar 
to the usual Schechter (1976) parameterization (exponen- 
tial times power-law) of the galaxy and cluster luminosity 
functions. 

These predictions have been successfully checked 
against direct galaxy counts (Alimi ct al. 1990, Mauro- 
gordato et al. 1992, Benoist et al. 1998). Numerical sim- 
ulations also stongly back this description of the scaling 
properties induced by hierarchical clustering (Bouchet et 
al. 1991, Colombi et al. 1994, 1995). Some departures from 
the scaling law however were found by Colombi et al. 1996, 
whereas Munshi et al. 1998a find consistency with the nu- 
merical data, see discussion of this point by Valageas et 
al. 1999. 

A quite convenient form of C{''')^ that bears all the de- 
sired feature, asymptotic form at small x and exponential 
with large x,, is (BeS92) 



C(r) = (l + -)-, 

K 

which leads to 
u = 



K + 2' 

1 {k + 2)"^+^ 
k{k + 1)«+i ■ 



(29) 

(30) 
(31) 
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The choice of k; = 1.3 insures a good agreement with the 
counts in cell statistics obtained in CDM numerical sim- 
ulations (Bouchet et al. 1991b), whereas power-law ini- 
tial conditions (Colombi et al. 1996, Munshi et al. 1998b, 
Valageas et al. 1999) lead to k - 1.6,1.3,0.9, for f7 = 1 
and, respectively, a spectral index n = 0, — 1, —2. 

These forms are very close to the form proposed by 
Bernardeau (1992) derived from the equations of the dy- 
namics in the quasi-linear approximation. The latter finds 
(29) with K = 1.5 is relevant already in the early non- 
linear regime (when smoothing effects are neglected that 
is for n = —3). 

2.4- Sum rules 

The normalization properties for h{x) are worth being re- 
called. Indeed we have, 



f 

Jo 



xh{x)dx = 1 , 



reminiscent of, 



Jo 



mp(m)dm = pv . 



(32) 



(33) 



Both relations are exact sum rules, showing that whenever 
p{ni) in the previous integral is replaced by its approxi- 
mate form, (23), which is valid for the distribution of dense 
cells, there is no error in computing the total mass, which 
is thus shown to essentially be contained in these denser 
cells. The normalization property. 



f 

Jo 



p{m)dm = 1 , 



(34) 



which means the cells occupy all the volume on the uni- 
verse, however has no simple analogue for h{x). Indeed we 
have, 



Xr. 



h{x)dx 



a 



„w-l 



r(c.) 



r(^) 



(35) 



that implies that the volume fraction occupied by the 
dense cells described by h{x) is, 



J m 



p(m)dm < 



(36) 



which is small as compared to unity. This shows, indeed, 
that h{x) describes the denser cells, that contain all the 
matter but, occupying a negligible fraction of the volume, 
may be expected to be surrounded by regions with a neg- 
ligible fraction of the matter. This justifies the referrence 
to these cells as "halos". It will be the correlations of 
these dense cells that we aim to calculate in the follow- 
ing. Clearly, would we use the full expression (11) for the 



probability, which holds in all cases, even for the nearly 
empty regions with x Xy, and include the latter in the 
sum, the normalization would of course be unity, Eq. (34). 

For completeness, we recall also that the normalization 
of rric (Eq. 11) that is used to define x has been chosen so 
as to yield 



f 

Jo 



x^ h{x)dx = 1 . 



And more generally we have, 
x^ h{x)dx = Sp . 



L 



(37) 



(38) 



Important constraints can be obtained from the sum 
rule (38). Indeed, writing that. 



/■OO 

/ {x- afxP h{x)dx > 
Jo 

that is 

Sp-\.2 — 2aSp-\.\ -\- c^Sp > 



(39) 



(40) 



whatever a. For the value a = Sp+i/Sp that minimises the 
above expression, we get the constraint. 



Sp+2Sp — Sp_^_i > 0. 



(41) 



The same constraints hold if the counts are discrete, still 
in the limit where ^ is large. One simply has to use the 
exact expression (A2) of p{N) for the probability and the 
weight N{N — 1)...{N — p)p{N) instead oi xPh{x) so as to 
take advantage of the properties (BaS89) of the factorial 
moments. 

Such constraints were first derived in a slightly differ- 
ent form by Peebles (1980) for 5*3 and Fry (1986) for all 
the terms. They imply for instance, 



& > 1, 



(42) 



and also that the many-body coefficients Sp can never 
vanish, as is directly obvious from (38) since h{x) is posi- 
tive. 

There are also several mathematical forms for h{x) 
that will be useful. Integrating (25) by parts, we get, 



/+ioo 



lOO 

-fioo 



27ri dy 

dy 



2m 



e-^C(T) . 



(43) 



Using r as a variable, a second integration by parts gives, 
^""W^-Zl^e-.if), (44, 

where however the integration contour of r has to be suit- 
ably defined according to the function r(y) defined by 
(16). 
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3. Generating functions for conditional cell counts which permits the calculation of p^^\m), 



We still consider the Universe divided in small cells. The 

central problem wc solve in this paper is the calculation 
of the correlations, at any order, among cells that contain 
a specific amount of matter. 

Wc consider a large volume V that contains P cells, 
labeled by i = 1...P, of volume Vi, respectively. Our 
aim is to derive the joint mass distribution functions 
p{mi, . . . , TOp)dTOi . . . Amp, probability that the P cells 
contain respectively the masses mi .. .mp. We will then 
assume that such a joint distribution function is the joint 
density distribution functions of the halos of the density 
field. ' 

3.1. Biased two-body correlation function 

Wc assign a position to each of the cells, to which we 
associate respectively the typical masses md and m^i de- 
fined as previously. We then restrict our calculation to the 
case mvi <C mi which corresponds to the condensed ob- 
jects. The case of the joint mass distribution function for 
two cells has already been addressed in a previous paper 
(BeS92). We have 



p(rni, ri; m2, r2)dmidTO2 = p{mi)p{m2)dmidm2 
+P^^^(mi)^2(ri,r2)p^^^(m2)dmidm2. 



(45) 



When m ^ m^, so that the scaling (23) applies, the func- 
tion p(^^(m) is related to a new series of parameter Sp^\ 
that instead of being averages over a unique cell (10) are 
now defined as mixed averages within two (small) cells of 
volume vi and V2, respectively, under the constraint that 
the center ri and r2 of these two (distant) cells remain 
fixed. In general, for tree models, the S'p^^' parameters de- 
pend only on the average of the p + 1-point correlation 
function when p points are in one cell and the other in the 
second. A similar situation is encountered in the quasi- 
linear regime (Bernardeau 1996). We thus have generi- 
cally. 



/ 



f-H2(ri,r2) 



d^r'i d^r; 



^p+i(r;,...,r^,r2). 



(46) 



In the case of the hierarchical models, the s'^'^ parameters 
are independent of both the cell volume and |ri — r2 
soon as the finite cell size effect are neglected, i.e., v{ 
(resp. V2^) <C |ri — r2|) and are specific numbers related 
to the matter correlation functions. The fimction p*^^^(m) 
is then related to S^^ exactly by the same way p{m) is re- 
lated to Sp. We can as well defined the generating function 
of this new series 



as 

1/3 



p(i)(m) 



(48) 



In case of the minimal tree-hierarchical model we have 
(BeS92 and Appendix B), 



(49) 



where C is given by (14) and r is solution of the equation 
(16), so that we have the simple relation <p^^\y) = T{y). 

For the general tree-hierarchical models the situation 
is slightly more complicated and the Sp^^ coefficients de- 
pend on vertices Vq. With outgoing lines starting at r', 
one ending at r2 (far away) outside, whereas q — 1 ones 
end within the cell centered at ri, at r[ . . . r^_]^ that are 
averaged over the cell volume, in a procedure similar to 
(46), we get. 



z/W= / t.,(ri-r',...,r;_i-r',r2-r') 

f 1 

6(ri - r') 6(r;_i - r') d^r^ d^r'^.i d^r' 



Vl 



Vl Vl 



(50) 



We can define the generating function for these vertex 
weights (i^i^^ = 1), 



9=1 



c<«M=|:(-i)vu.^. 



(51) 



The function (p^^^ (y) is then related to the function (^-^^ (t) 
by the relationship. 



(52) 



This equation defines (p'^^^y) in terms of the function T{y) 
given by (16) which depends only on the statistics within 
one cell and is thus considered here to be known. The 
problem is then reduced to the construction of ('•^^{t), 
that is to calculate the averages (50). For the minimal 
tree models where Vq is constant, this is straightforward 
and the quantities i^q^'' just identify to the mere vertices 
i/g, so we have C^^''(''') = 'iC(T)/dT. 

The relation (45) together with the function ip^^\y) 
gives the correlation function between two cells, 

c / ^ P^^\rni) p(i)(m2) , , 
6("T'i,ri;m2,r2) = — r-6(ri,r2) — r- (53) 



p{mi) 



p{m2) 



which has the same spatial dependence than the matter 
with a bias factor b{m) given by, 



^(l)(y) 



(47) 



b{m) = 



pW(m) 
p(m) 



(54) 
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This function describes the departure between the matter 

correlation and the halo correlation for the two-point func- 
tion. It turns out to be a function of a; = m/wc oc m/r^~'' 
only, 



/>+i00 n^lCO 
dy^W(t/)e-V/ dy^{y)e-y, (55) 
-ioo ioo 

SO that (45) reads, 

j?(mi , ri ; 7712 , r2)dTOidTO2 = p(mi)p(m2)dTOidTO2 

[l + 5(.Ti)/;(.T2)C2(ri,r2)]. (56) 

where xi (resp. X2 ) is the internal x parameter of cell 1 
(resp. 2). 

Two m,odel independent properties (in the sense they 
do not depend on the function (^^^ (r) to be constructed in 
a specific model) result thus from hierarchical clustering 
in the non-linear regime: a factorization property, the total 
bias being a product of two factors that refer to the in- 
ternal properties of each cell, and a scaling property, these 
factors depending only on the internal scaling parameter 
X of each cell. 

The observational consequences of these results have 
been widely discussed in BeS92. These predictions have 
been checked against observations. Benoist et al. (1996) 
show that the amplitude of the galaxy correlation func- 
tion is definitely dependent on their luminosity (the ex- 
istence of a luminosity dependence of the galaxy corre- 
lations showing their distribution is biased with respect 
to the matter was by no means obvious: see Hamilton 
(1988a) and Valls-Gabaud et al. (1989) for an early dis- 
cussion on this point). With the uncertainty related to the 
transformation of our x parameter into luminosity (see 
BeS91, VS98), the observed bias follows very closely the 
predicted trend. Numerical simulations also (Munshi et al. 
1999) exhibit the predicted scaling of b{x) with x as well 
as the factorization property (56). 

3.2. The bias for many-body correlations 

The above considerations can be generalized to calculate 
the correlations of an arbitrary number of cells, again un- 
der the condition that their distance is much larger than 
their size. In general, for a given number of cells the con- 
nected part of the joint mass distribution function follows 
the formation rules of the tree formalism (the demonstra- 
tions are given in detail in the appendices). Indeed the 
result is, 

^p(mi,ri;...;mp,rp) = 
^ Q^p\mi,ri;...;mp,rp)Y[^2{ri,rj), (57) 

trees(a) links 

with the same structure as the one encountered in equa- 
tion (5). The normalization parameters Q^"^ are homo- 
geneous functions of the geometry of the positions of the 



cells ri, . . . , rp, and functions of the masses mi, ... , mp 
attributed to each of the cells. For each particular tree 
connecting the P cells, Qp ^ is obtained by a product over 
the set of vertices (whose order we label by Q) associated 
to the tree (a). 



(5("^(mi,ri; . . . ;mp,rp) = 

-pj- p^'^)(mj,r^,-r,, . . . ,r^q-r») 
J-J- p{mi) 

This form of Q defines "effective" vertices u, 



i^Q(ri-r,...,rQ-r) = 



j3^'^)(m,ri-r, . . . ,rQ-r) 
p{m) 



(58) 



(59) 



The vertex associated with the cell labeled (i) depends 
on the mass m, and possibly on the geometry of the Q 
outgoing lines that appear in the tree representation of 
each particular term. 

The functions p^^\m) are related to the matter cor- 
relation function properties. More specifically these func- 
tions are generated by an series, defined by. 



>rQ -r) 



1 1 
f-ie2(ri,r)...6(rQ,r) 

d^i 



J V V V 



(60) 



In case of the minimal tree model, the parameters S'p'^'' are 
pure numbers that do not depend on the size of the cells 
{vi), neither on the geometries of the Q outgoing hues. For 
the general hierarchical models a dependence with the ge- 
ometry of these lines is expected since the matter correla- 
tion function are supposed to present such a dependence. 
The derivation of the value associated to the dressed ver- 
tices p'^'^\m) is based on the same principle as for p{m) 
and p^^\m), 



+ 100 1 

p(«)(m)dm = - ^ I -J-^(Q){y) e^y, 



dm 



iruc 



C — lOO 



27ri' 



with 

oo „ 



p=i 



(61) 



(62) 



As can be seen by comparing (8) and (58), the dressed 
Q^") can then be written as a product of vertices given by 



/-t-lOO /"-t-lOO 
dy ip^'^\y)e-y / dy ip{y)e-y. (63) 

-ioo J —ioo 

We have obviously i'q^x) = 1 and I'li^x) = 6(3;). The corre- 
lation functions of the condensed objects are thus based on 
a tree structure similar to the one of the matter. As the dis- 
tribution of the matter field is given by the "microscopic" 
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vertices Uq, that depend on position unless we use the min- 
imal tree hierarchical model, the distribution of the cells 
is given by these new "dressed" vertices vq . These vertices 
depend on the internal cell properties through the scaling 
parameter x of the cell. They do not depend on position 
because all the calculations are done in the limit where 
the cell size is much smaller than the distance between 
cells. 

In the following the relationship between uq (x) and the 
matter correlation functions will be explicited. We natu- 
rally expect that these results obtained for cells randomly 
distributed in the Universe are also valid for the counts 
of astrophysical objects. A precise calculation of the halo 
correlations would require a more complicated calculation, 
but the general properties we find here are expected to be 
valid. Some departures cannot naturally be excluded al- 
though they are expected to be small since the densest 
spots of the present universe contrast extremely well with 
the matter field. 

3.3. General properties of the galaxy and cluster correla- 
tion functions 

Before calculating the vertices vqIx), wc can already de- 
rive a set of general properties of the dressed correlation 
functions, that are independent of the specific model used 
for the matter correlation functions. They depend only on 
the assumption of an underlying tree structure of the lat- 
ter. These results are the core of what has to be retained 
from these calculations. 

First of all in the strongly nonlinear regime the cor- 
relation functions of the halos have the same scale, and 
time, dependence as the matter: 

for 1^ = 1, ^^^'°(Ari,...,Arp,/it) = 



2/3(3-7) 



p-i 



ep^'''{ri,...,vp,t), 



(64) 



and so that the hierarchical properties assumed to the 
matter field should also be present in the galaxy field, or 
in the cluster field. As a result the ratios. 



-fhalo 



-rhalo 



(5 



— hak 
2 



P-1 ' 



(65) 



are also scale independent. The fact that indeed the ob- 
servations confirm the existence of such property is an 
indication of the validity of the hierarchical hypothesis at 
the matter distribution level. It is thus a confirmation of 
the vahdity of the hypothesis (8). 

The index 7 measuring the slope of the two-point 
galaxy correlation function in the nonlinear regime has 
also to remain identical to the one of the matter field, al- 
though the normalizations are not the same (see BeS92 
for a complete discussion). As a result the pair velocity 



correlation between galaxies is expected to have the dis- 
tance dependence expected in the hypothesis of no-bias. 
This is what is indeed observed as stressed by Peebles 
(1987), and is by no means in contradiction the existence 
of a non-trivial natural bias. However it does not mean 
that the large scale (quasilinear, see Bernardeau 1996) 
and small scale (nonlinear, considered here) biases should 
be the same. A short examination of the equation (50) 
proves that the parameters involved for the values of b{x) 
are completely determined by the nonlinear regime in case 
of the small-scale bias, whereas they are given by a mix- 
ture of the small scale and large scale correlation function 
behaviors for the bias at large scale. A quantitative, if not 
qualitative, change is expected to occur when one goes 
from one regime to the other, which may lead to a vari- 
ation of b. This is not in contradiction with the previous 
remark on the index 7 since in the former case, we suppose 
to be only in the strongly nonlinear regime. 

The normalization parameters (the Sp in the equation 
[65]) that arc measurable for the galaxy or cluster fields 
have no reason to be identical to the ones of the matter 
field. As a result it would be a complete nonsense to try 
to discriminate theories with the use, for instance, of the 
skewness of the matter field obtained in numerical simu- 
lation with the one of the galaxy field. 

The strength of the correlation fimctions, so the biases 
at any level, depend only on the internal parameter, x, of 
the objects. This is a property of great interest for observa- 
tional checks. Indeed objects of different natures can have 
the same x parameter. For instance the brightest galax- 
ies and the rich clusters should have the same biases, the 
"common" galaxies should have biases comparable to ones 
of the groups (BeS92). Such features can easily be checked 
in the present or coming catalogues. 

The last property concerns the i^q (x) parameters that 
govern the correlation strength. It has just been stressed 
that they are a function of x only. We also point out 
that they describe as well the auto-corrclation functions 
as the cross-correlation functions between objects of dif- 
ferent kind or between objects and the matter field. What 
does it mean in practice? Consider for instance three 
fields of any kind, galaxy, cluster or matter. Then one 
may want to know what is the three-point density func- 
tion of these fields, i.e., what is the joint expected densi- 
ties of these three fields. The three-point density function 
n^^'(ri,r2,r3) can be written 



n(^^(ri,r2,r3) = nin2nz + ni6i^2(ri, r2)'>2n2n3 +cyc. 



+ni6i^2(ri, r2)i/2,2n26(r2, r3)&3n3 -|- eye. 



(66) 



where ni is the density of the objects of kind 1, n2 of 
kind 2 and of kind 3, h\, 62 and 63 are their respective 
bias parameters. The coefficients V2.i arc the parameters 
describing the three-point correlation function of the ob- 
jects of kind i. The possible geometric dependence In Fig. 
(2) we give a diagrammatic representation of this relation. 
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.[V][V][V] 



Fig. 2. Diagrammatic representation of the three-point halo 
density function. Each filled point represents a halo at the po- 
sition ri, r2 or ra. Each line equals the matter two-point cor- 
relation function, the points with one lino have a factor rn bi 
depending to which point i it is attached, and the points with 
two lines have the factor m V2,i- The latter is independent of the 
positions of the points in case of the minimal tree-hierarchical 
model, it is an homogeneous function of the positions in the 
general case. 

These kinds of relations based on tree constructions 
can be done at any order where the correlation properties 
of any kind of object is fully determined by the scries of 
the vertices vq. The knowledge of the vertices vq then 
give very diverse results concerning the auto-correlation 
properties as well as the cross-correlation properties of any 
kind. This may be of great interest since one can imagine 
observational tests to check such a property. The simplest 
is to check that the auto-correlation function of galaxies, 
^gg, the cross-correlation between galaxies and clusters, 
^gc, and the auto-correlation function of clusters, ^cc, ver- 
ify the relation, 



^CC ^; 



gg 



(67) 



One can also construct some relations involving the three- 
point correlation functions between galaxies and clusters 
that should be verified in catalogues. 



^ccg — 



2 Cgg 

3 3 ^gg 



(68) 



The verification of these relations is a check of the hypoth- 
esis (3) as well as of the hypothesis (8). It would also be 
the indication that the clustering properties of the astro- 
physical objects have a pure gravitational origin. 

The relation (66), in case the objects are of the same 
kind, reduces to, 

n(-')(ri,r2,r3) =n3 [l + e2^-'°(ri, rz) + eye. 

+Q3 52"'°(ri, r2) 52^^'°(r2, ra) + eye] , (69) 

where ^2'''° = 6^^2 is the measurable two-point correlation 
function and Q3 = The latter is the parameter 

describing the three-point correlation function and can be 
measured directly. Such a remark can be made at any 
order so that we are led to define effective vertex weights. 



(70) 



(note that Q3 = vi). These parameters are the only ones 
that can be measured directly in catalogues and the only 
quantity that cannot be measured directly is h. 



3.4- The halo correlation functions 

3.4.1. The general expression of the vertices 

As it has been discussed previously the correlation func- 
tions of the astrophysical objects arc completely deter- 
mined by the parameters i'q{x) (Eq. [63]). The latter are 
determined by the parameters S'^'' (Eq. [60]) that depend 
on the vertices of the matter correlation functions. They 
depend more precisely on averages over the elementary 
vertices Vq, located at r in V, with q outgoing lines, Q 
of which end up in V far from r and q — Q oi which end 
within a volume v centered at r 



d^rd^ri d^rg f d^rg+i d^r, 



V V V V 
xuq{ri -r, ...,rQ-r,rQ+i-r, 

^^(i"i,r) C(i-Q,r) C(rQ+i,r) 



^rq-r) 

e(_r,,r) 



(71) 



where q represents the total number of lines connected to 
the vertex and Q is the number of long lines (correspond- 
ing to the volume V). In the tree models Vq^Q is a param- 
eter independent of the size of the volumes v and V. In 
case of the minimal tree model, one obtains Vq^q = Uq but 
we prefer to work without such a restrictive hypothesis. 
These parameters define a function of two variables, 



C{r,9) 



q,Q 



(-1)«6'« 



(72) 



In case of the minimal tree model the function ({t, 9) is 
simply a function oi t -\- 6 and can be directly deduced 
from the knowledge of the count-in-cells statistics, that is 
the knowledge of the function ^, 



minimal 



(r,0) = C(T + e) . 



(73) 



In general, all the informations that arc required for the 
calculations of the many-body correlation function bias 
we want to do are contained in the function C,{t, 9). 

On the other hand the halo correlations are entirely 
driven by the parameters i'q{x) that, as for the matter 
field, define a generating function 



°° ffQ 
C(a;,e) = ^ (-l)%(a;)-. 

Q=0 ^' 



(74) 



As a result, the halos of the density field of scaling param- 
eter X have correlations given by the function C,{x, 6) with 
the meaning of the equations (5-8). In the appendices, 
we derive the expression of ({x, 9) from the expression of 
({t,9). The result can be expressed in terms of an auxil- 
iary function, h{x,9), through. 



ax,e) 



h{x,9) 
h{x) 



(75) 
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The function h{x,9) generalizes the function h{x) with an 
extra dependence with the parameter 6 that describes the 
environment. Wo have, 



which is analogous to (17), Eq. (76) can also be written 



h{x,e) 



+ioo 



dy 



27ri 



(76) 

(77) 
(78) 



We note here that, since Ci^^^ = 0) = C(''')) we have 
ip{y, 6 = 0) = (fi{y) and h{x, 9 = 0) = h{x) (this justifies 
the notation we adopted). 

This is the central result of this paper. It is established 
in detail in the appendix B. It is valid only provided (24) 
is satisfied so as the scaling in x holds, as we generically 
assume here for the objects we call halos. 

The expression for the vertex generating function in 
the general case, when the lower masses arc included, and 
even in case the density field is represented by discrete 
points, is given in App. B. It amounts in general to use 
^cxp(— (p(y,^?)/^) instead oi —(p{y,d) in Eq. (76), the def- 
inition of C, Eq. (75), being changed accordingly. In this 
case there is no scaling in x for the underdense cells, and 
the x-dependence is to be replaced by the dependence on 
mass m. 

The equations (75-78) explicit the relationship. 



'p{y,o) 



^(-l)Q^(Q)iy)"-, 



0Q 



(79) 



between (jj'^ the vertices Vq,Q that arc directly de- 
rived from the matter correlations as a relation between 
{y) and C(t, 9). Prom ip{y, 9), it is then possible to de- 
duce ({x, 9). The latter may be directly related to ({t, 9) 
by means of Eqs. (82) or (83). The vertex generating func- 
tion C{x,9) plays the same role for describing the statistics 
of the cells labeled by their parameter x as the generating 
function ({t) which gives the statistics of the matter field 
(11, 15). The expansion of f{y,9) in powers of 9 defines 
the dressed Sp{x) coefficients, whose equations can also 
be obtained by directly expanding (75-78) . All properties 
of the halo correlations can be derived from this equation. 
In particular it is easy to see that. 



<p^'Hy) = y^{r,0)\e=o, 



(80) 



which is the generating function of the trees with one long 
external line. 

These relations can be written in several equivalent 
forms. Prom, 



d^iy,e) 
dy 



C{r,9) 



(81) 



Hx,9)= J ^^C{r,9)e^y , 



as well as, after a second integration by parts, 
dr dar,9) 



x^h{x,9) 



27ri dr 



(82) 



(83) 



As in (44), the integration is to be made over a contour 
that is defined by the relation (78) which determines the 
function y(r, 9). 

3.4.2. The normalization properties 

Considering the function (^{x, 9) defined in equation (74) 
and given in (75-78) it is straightforward to demonstrate 
that, 



X C{x,9) h{x) dx = C{t,0)\t=o, 



which means that whatever Q we have. 



Jo 



X vqix) h{x) dx = v^ 



q=Q 



(84) 



(85) 



These normalization properties (Eqs. [84, 85]) are quite 

important and deserve attention. It means that at a given 
scale the mean correlations of the dense cells are the ones 
of the matter field, once the mean is calculated in a proper 
way: each object has to be weighted be its own internal 
scaling parameter x which is equivalent to have the prob- 
ability weighted by the mass. It is thus clearly seen that 
the dense cells represent the matter content of the uni- 
verse : when properly averaged, the dense cell correlations 
are identical to the correlations of the matter points, at 
all orders. It implies that the halos are not biased as a 
whole compared to the matter field. Nevertheless, a pop- 
ulation of particular object is always biased because there 
is a segregation in the correlation properties of the ha- 
los i.e., vq{x) varies with x. It is clear from (85) that 
an a;-independent vertex uq implies i^q{x) = The 
fact that the clusters arc more correlated that the galaxies 
(Bahcall 1979) or a luminosity-dependence of the galaxy 
correlation function (Benoist et al. 1998) is an indication 
that such a segregation indeed exists in the Universe and 
implies that galaxies are not expected to be correlated as 
the matter field. 

Another sum rule involves also the underdense cells 
(App. B), and in the continuous limit (where the x- 
dependence of the vertices is to be replaced by their m- 
dependence) reads 



C(to, 9) p{m) dm = 1, 



(86) 
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which means that whatever Q we have, 



Jo 



yq (to) p{m) dm = 0. 



(87) 



So, the underdense cells must be necessarily antibiased 
{vq < 0), as a gcncrahsation of the antibiaising prop- 
erty found (BeS92) for b{m) {yi(rn) here) that was seen 
to change sign for m ~ TOv 

Similar relations can be written for a distribution of 
points, and are given in App. B. 

3.4.3. The validity limits 

The calculations that are presented in the previous section 
are valid under three conditions: 

— The spatial size of the halo has to be smaller that the 
correlation length so that the mean value of the two- 
point matter correlation function in the volume occu- 
pied by the object is larger than unity. 

— The space dependence of the correlation functions is 
valid when the size of the halos is smaller than their 
(mean) relative distances. More precisely the calcula- 
tions are made to leading order in the limit of vanish- 
ing values of ^iy)IS,{vi) where £,{vi) is the mean value 
of the matter correlation function within the object i 
and $,{V) is its mean value at a scale at which we want 
to derive the halo correlations. For instance if galax- 
ies are assumed to have a size of 100/i~^kpc including 
their dark halo, at the l/i~^Mpc scale the corrective 
terms corresponding to finite cell size effects for their 
correlation functions are expected to be of the order of 
1%. 

— The scale dependence in x is valid when the internal 
mass exceeds a mass threshold my{vi) depending on 
the size of the object. At the 100/i~^kpc scale we have 

\ 3-t-7w/(l— w) 



R 



TO.(i?)=2 10^M,^^^^^^j ,(88) 
so that galaxies are far above that limit. 



3. 5. The rare halo correlation functions 

The only parameters that can be directly measured in the 
catalogues are the parameters i>Q {x) as defined in equation 
(70). Their generating function. 



ax,e) = i-e + u2^- 

is related to Q{x, 6) by 

~ax,6) = ax,e/b{x)). 



^3 



0^ 

3! 



(89) 



(90) 



The main analytical result that can be derived from the 
equation (75-78) concerns the large x behavior. With the 
sole assumption that ({t, 0) is a regular function that does 



not contain any spurious singularity for the useful^ values 
of r, we obtain a general result that is valid for any tree- 
hierarchical model (Appendix C), 



h{x) (X x' 



10 



b{x) cx X 

^q(^) ~^ 1 whatever Q and Sp{x) pP-^ 

Cix,0) ~ e-^*(^' 

ax,9) ~ e-', 



(91) 



when a; ^ 1. Note that this limit concerns fully nonlin- 
ear objects but is valid for any correlation scale (linear 
or nonlinear). This result has been presented in an early 
work (Bernardeau, PhD thesis, Paris 1992), and is sug- 
gested by the results obtained by Munshi et al. (1998b). 
It is relevant a priori for describing the correlation of very 
bright galaxies. Indeed, the observations called immedi- 
ately at the time these concepts were discovered for such 
a modeling of the distribution of bright galaxies: the ly = 
model of Schaeffer (1984, 1985) for the galaxy many-body 
correlations is equivalent to ^qix) = 1 in the language of 
the present paper. 

3.5.1. Counting overdense cells 

We are now all set to give, as one of the possible applica- 
tions of the formulae established previously, the statistics 
of the number of cells of size v within volume V that con- 
tain a mass larger than m, (which is labeled by a scaling 
parameter larger than x). For simplicity we call these cells 
"full" cells . The probability P(> to, N) to have N of such 
full cells can be written 



P(> TO, TV) = 



1 



dA 



27ri / A^+i 



pXv(A) 



with (App. B), 

XV W = -^{>x,Y)/aV) 
$(> x,Y) = ¥({> x,e) - lYe^{> x,6) 

r = (1 - A) iVe , iVe = n(> to) Vl{V); 

where 



n(> to) = —p{> to) = -^h{> x) 

V TOc 



(92) 

(93) 
(94) 

(95) 
(96) 

(97) 



^ so that |j/s I corresponds to a singularity of implicit equation 
(78) for T rather than to a singularity of ( and hence to a finite 
value of Ts around which ( can be expanded, as is required by 
the model-independent condition. Sect. 2.3. The required con- 
dition are not fulfilled, for instance, by the model of Hamilton 
where Ts is infinite. 



14 



Halo correlations in nonlinear cosmic density field 



is the average number density of full cells. This is the 
analogue of (11) for discrete counts. For A'' large enough 
we have 



P(> m,N) = _\ H{> x,X),X= ^ 
with 



H{>x,X) = - $(>ar,y)e^^ 

J-ioo ^7^1 



(98) 



(99) 



corresponding exactly to (11). The moments of X then 
are by construction 



j dXX^ H{> x,X) = Sp{> x). 



(100) 



The positivity of iJ (> x,X) then implies the positivity of 
the Sp coefficients as well as the constraints (41) for the 
latter. 

Identical formulae can be written for halos containing 
a mass between m and m + Am, using C(x, 9) in place of 
C(> x,6) and n(m)Am in place of n(> m), with the same 
constraints (41) on Sp{x). 

4. Models for the matter correlations 

4-1- Specific models for ({t, 6) 

As can bo seen in the equation (70) the form of the 
halo correlation functions is entirely determined by C,{t, 9). 
This function is expected to obey to various rules. For the 
general case, two scales are involved. When both are in 
the nonlinear regime, one expects that, 



^qfi = ^q,q SO that C{t,0) = C(O,^)|0=r- 



(101) 



The numerical results presented in the following part 
have then been partly obtained with. 



c = 



1 + 



with K = 1.3, 



(102) 



relevant (BeS92) to the non-linear regime with CDM ini- 
tial conditions. 

However, we do not exclude that the larger scale may 
reach the linear regime. In such a case the relation (101) 
is not expected to be valid, but the parameters i>q q are 
expected to take the values found in the quasi-Gaussian 
regime from perturbative calculations (Bernardeau 1992, 
1994) that is for instance C{0,&) ~ [1 + 29/3]-^/'^ indepen- 
dently of f2 for n = — 3. 

We also introduce an other explicit model that does 
not lead to mathematical inconsistencies. This one is based 
on a property of "scale factorizability" which means that 
we assume that Uq^Q = Vq-Q yq. This can be understood 
qualitatively since we have two scales v and V different 



enough, so that the averages are in some sense indepen- 
dent. In such a case the function C,{t, 6) is factorized and 



We can then adopt the form, 



^fact.(^^ 6*) 





— K 










L n\ 




K_ 



(103) 



(104) 



In both cases the function ^(r, ^ = 0) is known to agree 
with all the constraints on h{x) (BeS92). The form (104) 
is in agreement with equation (101). 

The numerical results presented in the following have 
been obtained with these two forms. 

We will also present "model-independent" predictions, 
obtained by the mere assumption. 



C(t) cx t-'' 

for large r, with, 

< K < 00 . 



(105) 



(106) 



This leaves out the case where C(t) is a polynomial in 
T, that will be discussed below as a generalization of the 
Hamilton (1988b) model. 

4-2. The minimal tree hierarchical model 

The minimal tree model is a priori an attractive model, 

for which the geometrical dependence of the matter corre- 
lation functions are well defined and entirely determined 
by the mass function. It is thus natural to try to calculate 
the cell correlations in such a model. 

The function (^(y) for the dressed vertices in the min- 
imal tree-hierarchical model, given by using, 

t = T + 9, (107) 

as a new function depending on y and 6 through, 

t = 9-yC{t), (108) 
can be written. 



'p{y,o) 



yC{t) - ly{t 



9)Cit). 



(109) 



Successive derivatives of ip{y, 9) with respect to 9 will give 

(i)(l/) = t{y,9)\e=^ (110) 



VP«)(y) 



(111) 



Halo correlations in nonlinear cosmic density field 



15 



As an application of these results we have, 





T(y); 




(112) 




yC" 
i + yC" 




(113) 


^(3)(y) = 


vC" 




(114) 


^(^\y) = 






(115) 


{1 + yC'f 


{i+ycr 


= 




+ 10 

(i + yCr 





where the derivatives are with respect to t, now obviously 
related to y by (108) with 9 = 0, 

t = -yCit). (117) 

The results for ip'^^^ and (/j'^-* have been given in BeS92 
and the three others by Munshi et al. (1998b) from an or- 
der by order expansion of the tree summation. The general 
expression (111) obtained here allow a simple and direct 
calculation of these functions to an arbitrary order^. 

It is also possible to express h{x,9) directly as a func- 
tion of C{t) 



e <^ 



This equation can be written, 

xh{x, 9) 



^ f{y,x9) e^y 



with, 



rt{v) 

f{y,9)= / dtCit) . 
Jo 



(118) 



(119) 



(120) 



This form is well suited for an expansion in powers 
of 9 and allows one to get directly an expression for the 
vertices j^q(x). With the definition 



fQ{y) = / dt i-Citf-'^ , 

Jo 

we get vq in terms of (, 

From the latter expression, it is readily seen that the func- 
tions <p^*^Hy) can be formally expressed as 



(121) 



(122) 



fV AW 

Jo ^'^^'^ ■ 



(123) 



^W)(y) = -(- 

^ Note that, as for the general case given by Eqs. (75- 
78), these expressions are valid only provided (24) holds, as 
we assume throughout for what we call halos. In general 
(see App. B), t{y,6) in Eqs. (110-111) is to be replaced by 
t{y,e)e^p[-V>iy,e)/l]. 



4.2.1. Small x limit 

In the limit x = 0, y gets of order 1/x and t as well gets 
very large. This immediately shows that in the latter limit, 
only the model-independent asymptotic form (19) of. 



at) = ct-\ 



(124) 



will contribute. Our calculation, thus, will be valid for 
any minimal tree hierarchical model satisfying the "model- 
independent" conditions. The solution of (119) then is 

t = {Kcyr, (125) 

with 

a=l/{K + 2). (126) 
This leads to the expression of fgiy) and we get. 



^ r[-(l-a)(Q-2)] 
With the help of the relation 



+ 100 



du 



u-^ e" = l/r((5) 



27ri 



(127) 



(128) 



the dressed vertices in the small x limit can then be ex- 
pressed as 



vq{x) 



I [2(1 - a)] 



r[-(l-a)(0-2)] 
We see that 

1 [1 — aj 

which is the bias calculated in BeS92. Also 
U2{x) =b^{x)Q3{x) = , 



(129) 



(130) 



(131) 



that vanishes to leading order in x and is given by higher 
terms than the one arising from the model-independent 
form (124), as discussed by the latter authors. This vi- 
olates the lower bound (42) on Q3 (Peebles 1980), now 
applied to the moments of (100) that are directly related 
to VQix), and implies that some of the values of the as- 
sociated probabilities are negative in the small x limit. 
This problem cannot be cured by including the terms, Eq. 
(B46), that arise for a; ^ a;^ since it is always possible 
to choose the volume v small enough so as to have Xy 
arbitrarily small. 

Note however that the generating function of. 



h{> x,9) 



h{x,9), 



is ip{y,9)/y, so the corresponding vertices are 



^q{> a;) = 



2r[2(l-a)] 



(2-Q)r[(l-a)(2-Q)]- 



(132) 



(133) 
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The corresponding Sp parameters, the first few are plotted 
on Figs 4-7, arc all finite at small x and do not violate the 
positivity constraints (41). 

In this case it is actually possible to get the generating 
function for the vertices, 



C(a;>0,^) = 

r+ioo , 



-k/{k.+2) v+^ey 



(k + 1)/(k + 2) 



with 



/3 = - 



r/~dyy-«/(«+2)e2' 



'[-1/(2 + /^)] 

r [k/2 + k] 



so that C{x > 0,e) ^ 1 ~ e. When 9 
asymptotic behavior given by, 



(134) 



(135) 



00 it has an 



C>>o,^) 

x6i-2/(«+i). 



r[K/(2 + K)] 



K+ir[(K-l)/(K+l)] 



^-2/(«+l) 



(136) 



It also corresponds to moments Sp{x) that satisfy the con- 
straints (41). 

We show in appendix that for all "model-independent" 
formulations (that is all models satisfying a series of 
needed constraints related to the count-in-ccUs, see BaS89 
and discussion in Sect. 2.3), some of the higher moments 
of the cell correlations become unduly negative at small 
X (sec Figs. 5-7). This indicates clearly that such a model 
for the matter distribution is mathematically unsafe. Since 
there is no problem for the larger values of a;, we prefer to 
say that the extension of the minimal model to calculate 
the cell correlations is insufficient^ to describe the distri- 
bution of the less dense among the over-dense cells. The 
form found for b{x) and <S'3(a;) for x >, .t* are attractive 
enough to explore more completely the consequences of 
such a model on the whole galaxy clustering properties. 

4-3. The Hamilton model and extensions 

Hamilton (1988b) proposed another form for the ver- 
tex generating function of the minimal tree-hierarchical 
model, that does not enter into the class that we have 
called "model-independent" . The latter obey conditions to 
be necessarily fulfilled so as to reproduce the known prop- 
erties of the galaxy and matter count-in-cell distribution. 
We have however seen in the previous section that lead to 
difficulties for the minimal model. The question then may 
be raised whether by relax;ing the model-independent con- 
ditions, probability laws that are acceptable for all values 
of X can be found. 



^ It is due to the vanishing of the leading order of the ex- 
pansion at small x. But the calculation of the higher orders of 
perturbation theory may actually not be necessary: small de- 
viations from the minimal model also may be sufficient to cure 
the problem. Such studies, however, are left for future work. 



4.3.1. Count-in-cells 

The vertex generating function of Hamilton (1988b), in its 
original form, is. 



(137) 



with arbitrary A: > . The solution of the equation (16) 
relating the r to the y parameter is. 



T = 



y 



l + ky' 

and ip{y) is, according to (15), 
1 + (fc - 1/2)2/ 



v{y) = y- 



l + ky 



(138) 



(139) 



Prom the requirement for ip{y) to be positive for all y > 
(see discussion in BaS89), and specifically here for large 
y, we get the constraint. 



A; > 1/2 . 



(140) 



The corresponding function h{x) (for x strictly positive) 
is then (Eq. 25), 



h{x) 



2k^ 



-x/k 



(141) 



It is readily seen that, unless k = 1/2, this form does not 
satisfy the normalization condition (32) and (37). Never- 
theless, the form (141) is perfectly suited to our discussion 
for k > 1/2. Simply there is only a fraction 1/2 A; of the 
matter in the dense cells, the remainder being in the un- 
derdense cells, whereas in the generic case the latter is 
negligible. The form (141), because of its restriction to 
a; > amounts to write fiy), 



'fiiy) 



1 



y 



2kl + ky 



k-l/2^ 



-y, 



(142) 



and to note that the last term does not contribute to h{x). 

If, however, we want to exclude scaling function with 
such peculiarities, we are restricted to. 



fc = 1/2 , 



(143) 



a case that was specially disciisscd by Hamilton since it 
allows one to get many-body correlations with a structure 
that repeats itself under the grouping of a subset of points 
within an infinitesimal cell. In the case (142), the scaling 
function is, 



h{x) = 4e" 



(144) 



This is an important qualitative difference. In the ac- 
tual counts, as discussed in BaS89 and confirmed by ob- 
servations (Benoist et al. 1998 and references therein) as 
well as simulations (Valageas et al. 1999 and references 
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therein), h{x)dx diverges for x — > 0, which means that 
the number of small objects increases without limit, so 
that their total number is not defined: this means that in 
realistic models the total number of objects depends on 
other parameters than the ones introduced for the count- 
in-cells. For the models considered here on the other hand 
Jq°° h{x)dx is finite and the total number of objects is de- 
termined by the same physics that the dense cell. So, the 
models considered in this section are too different from 
these realistic models to be used as a modeling of the 
matter distribution. 

4.3.2. The halo correlations 

The equation (78) with the form (137) for ^ has the solu- 
tion, 



^ l + ky ' 
and (77) yields, 



(145) 



(146) 



The last, constant, term does not contribute to the integral 
(76) that defines (^{x, 6) and we have, 

C{x,e) ^l~2ke + k^0^ . (147) 

So, the model (137) corresponds to a constant bias, 

b{x) = 2k , (148) 

normalized to unity since xh{x) is normalized to l/2k. 
Replacing 9 by 0/2k, we get the vertex generating function 
for the dense cells. 



C{x,e) = i-e + -^e'' 



(149) 



This is nothing but the k = 1/2 model. So, for an initial 
generating function with k = 1/2, there is no bias for 
the distribution of the halos. Hamilton's condition for the 
correlation functions to repeat themselves when changing 
scale is equivalent to have an unbiased distribution. For 
k > 1/2, the distribution of cells of size v at the larger 
scale V goes over to a fc = 1/2 distribution. 



If we transform these relations into equalities, for s = 1, 

c = 1, this is the Hamilton model. 

The equation (16) for ip{y) has, in the vicinity oit = 
1/k, the solution. 



1 — fct ~ {yck) 



-l/s 



(152) 



So, the behavior of at f = 1/k is related to the behavior 
at large y of ip. that, according to (15), has the expansion. 



^(y)-yC(^) + 2^-^^(2/cfc)-vv 

This yields the behavior of h{x) at small x, 



(153) 



h{x) 



„2-l/s 



sfc2+iAr(2 + l/s)* 



(154) 



Then with the usual definition (75-78) of y for the 
correlated cells and the definition (107) for t the equation 
(77) reads. 



t-0 = -yC'{t) 

and has the solution, 

yck 



This yields the expansion, at large y, 

^{y,0)^yC{T) + 



(155) 



(156) 



'k' 2fc2 



s (1 - key+y^ 

s + 1 A;2 
We then get for h{x, 6) 



{yck) 



-l/s 



(157) 



h{x,e) 



sA:2+i/«r(2 + l/.s)' 



(1 - key+y^ . (158) 



The vertex generating function for the cell correlations, in 
the limit x = is then 



4.3.3. Other forms of in the minimal tree-hierarchical 
model 

Let us consider here the general case where ('{t) has a 
zero for finite values of t 



C{t) ~ -c (1 - kty, s > . 

Around t = 1/k, has the expansion 



at) ~ ai/k) 



{s + l)k 



(1 - kt) 



(150) 



(151) 



= 0,e) = {l- k0)^+'^/' . (159) 
Its derivative has a zero of order l/sat^ = l/A; 



89 



{x = 0,9) = {l-key/' . 



(160) 



Clearly none of these forms enter in the "model- 
independent" category defined in Sect. 2.3 since h{x) has 
not the required behavior at the origin. 
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4-4- The factorized tree-hierarchical model 



In this paragraph we explore the consequences of the hy- 
pothesis (103) on the generating function C{T,d). Wc will 
see that this "scale-factoraziblity" hypothesis will provide 
us with a phenomenological model that has all the desired j^Q|.g ^jg^ ^j^g^^ 
properties. ' 

With the assumption (103), from (75-78), it is readily 
seen that, 



The first of these equations implies an a;-dependent bias 
that is expressed in terms of the scaling function h{x) as 



. t] 

b{> x) = - 



h{> x) dx 



d , , , xhix) 



h{> x)' 



(176) 



(177) 



C(r) - -^tC{t) 



^{y,0)=ym 

with, 

T = -yamr) . 

This implies the simple result, 
We thus get, 

and 

h{> x,6) =h > 
It leads to. 



a>x,e) = 



h{> x) 



> 



m 

x 

W) 



A systematic expansion in powers of 9 gives 

^^''(y) = i^sy^'iy) + 3//2?yV"(?y) + ?/V"'(2/); 

ip^^\y) = v^y^\y) + (3//^ + 4t.3)yV"(y) 

+6i/22/V"'(y) + yV""(y) , 



and 

v\(x)h(x) 
U2{x)h{x) 
V3{x)h{x) 



dx 



xh{x); 



-^'2-r-a;/i(a;) + -^x^hix); 
dx dx-' 

d d^ 
-i^3—xh{x) + 3iy2-^x'^h{x) 



(161) 4.4.1. Large x behavior 

For large x, from (28), the bias is 



(162) 

(163) 

(164) 



(173) 



(174) 



V4(x)h(x) = —Vi—xhix) + (3i/| + 4f3 
da; 

d^ _ d^ 



dx 



2 a; h{x) 



Qv2-r^x^h{x) + -—x^h(x) . 
dx'^ dx^ 



X X 

b{x) = cOs and 6(> x) = + 1, 

Xa. 



(178) 



with gcnerically Ws = —3/2. With the form (178), it is 
readily seen from (167) that 



C{x,e) = c 



b{x) 



~ e 



(179) 



We have the same limiting form for (,{> x, 



4.4.2. Small x limit 
At a; = 0, we have, 
(165) ^(a;^o,^)=C'-'^(^) 

which, with h{x) ~ x"~'^, gives 
6(a;^0) = 1-w, 

C{x = o,e) = c'- 



(166) 
(167) 



(168) 
(169) we have 







l-LO 



(180) 

(181) 
(182) 



The same results are obtained for b{x > 0) and ({x > 0, 9). 
Finally for 



C(t)= 1 + 



^(^x = o^e) = c{x>o,e)= i + - 



(170) 

(171) with 

K = (1 — Uj)k. 

(172) 4.4.3 

. A very special case 
Interestingly, for a; ^ or for a; ^ oo, 

CiT,9)^e-^-\ 



(183) 

(184) 

(185) 



(186) 



is invariant under the transformation of the correlations 
of elementary points into the cell correlations. In this case 
we have indeed, 



((x = 0,6) = C{x ^oo,9)= e- 



(187) 



Note however that this form is not preserved for any in- 

(175) 

termediate value of x. 
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4.5. Hierarchical models and tree-hierarchical models, 
comments 

The results presented throughout this paper give the qual- 
itative and quantitative behavior of the halo correlations 
in the evolved nonlinear density field. They are based on 
assumptions on the nonlinear matter correlation hmctions 
that have been now widely checked. Nevertheless in this 
section we aim to discuss the various results that we ob- 
tain with regards to the hypothesis that have been made 
at several stages. 

The shape of the nonlinear mass distribution function 
is derived from the assumption that the correlation func- 
tions follow a hierarchical pattern (10). This form implies 
that the mass distribution function takes the form (23). 

The joint mass distribution function for two cells is 
determined by the behavior of the correlation functions 
when the points separate into two different subsets that 
lie in two volumes vi and V2 at the positions ri and T2. 
For any hierarchical clustering we obtain 

?P1,P2(^1''^2) = 

Cp.,P. [^2ivi)r~' 6(ri,r2) [Uv2)r~\ (188) 
which implies that 

p(mi,ri;m2r2) dmi dm2 =p(mi) dmi p(m2) dm2 

l + 6(ri,r2)C(^,^)l. (189) 

The resulting correlation between the halos at the posi- 
tion ri and r2 exhibits some of the general properties pre- 
viously mentioned: 

— the distance dependence is the same as the one of the 
matter correlation function; 

— the bias parameter is a function of x only. 

But at this level of hypothesis there is no property of 
factorizability. This latter property is obtained with the 
tree hypothesis which implies that Cp^ = Cp^ Cp^ . As a 
result the bias function can be factorized since, 

/.'(.r,.2)= / ^^e--+-- 

/ ZTTl 27ri 



Y^i-ir^^^Cp,,: 



\Pl >P2 



Pi! P2! 



(190) 



which under the tree-hierarchical hypothesis reads 
/.'(.r,..) = /^ (B-ir^.f) 

X / St f ) 



(191) 



Such remarks can be generalized to any order of the halo 
correlations. As a result we get the following properties: 
The hierarchical hypothesis implies that 



— the halos are correlated in a hierarchical way (Eq. [65]) 
with the same scale dependence as the matter correla- 
tion functions; 

— the strength of the correlation functions, whatever the 
order, is a function of the scaling parameter x = m/rric 
only 

The tree-hierarchical hypothesis implies that 

— the halo correlations follow a tree-hierarchical pattern 
too; 

— they take a specific form (Eq. [91]) in the very massive 
halo limit. 

Note that in case of the minimal tree model, the halo 
correlations also follow a minimal tree-hierarchical model. 



5. The observational consequences 

5.1. The two- and three-point correlation functions 




0.001 O.OiO 0.100 1.000 10.000 
x/x, 

10 1 




0.001 



10.000 



Fig. 3. Bias factor as a function of the scaling parameter x. 
The solid lines correspond to the model given by the equa- 
tion (104) and the dashed lines corresponds to a minimal 
tree- hierarchical model (Eq. [102]). In both cases we have used 
K = 1.3 for describing the matter field. In the top panel we rep- 
resent the bias as a function of x (Eq. [55]) and in the bottom 
one for a threshold in the density field. 



The results obtained for the two-point correlation func- 
tions of the halos have already been discussed on an ob- 
servational point of view in a previous paper (BeS92). 
In that paper wc only gave results for the minimal tree- 
hierarchical model. In this paper we introduce an other 
specific model (Eq. [104]), assuming the small and large 
scale-dependence of the generating function factorize, for 
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0.001 0.010 0.100 1.000 10.000 

x/x. 

Fig. 4. The three-point parameter as a function of x. The 

parameter S3 = 3 Q3 describes the strength of the three-point 
halo correlation function (Eq. [69]). The solid line (respectively 
dashed line) gives 5*3(0:) for the minimal tree (respectively 
factorized) model, the dotted line (respectively dotted-dashed 
line) gives S3{> x). 
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Fig. 5. The same as in Fig. 4 for 84. 




0.001 0.010 0.100 1.000 10.000 

x/x. 

Fig. 6. The same as in Fig. 4 for S5. 




0.001 0.010 0.100 1.000 10.000 

x/x. 

Fig. 7. The same as in Fig. 4 for Se- 

which wc calculate the shape of the bias parameter and of 
the higher order statistical properties. 

In Fig. 3 we present the function b{x) for the two mod- 
els. We also present the results for b{> x) that give the 
statistical properties of the objects defined by a threshold. 
The difference between the two cases is an indication of the 
magnitude of the variations that may occur within the hy- 
pothesis of hierarchical clustering. In both cases the bias 
is proportional to x for large values of x, and is weakly 
varying or nearly constant for small values of x. These 
results are natural consequences of the hierarchical clus- 
tering and by no mean have been imposed a priori. The 
general properties of these quantities have been given in 
(BeS92). Two properties of observational interest (Eqs. 
[67-68]) are given in Sect. 2.3. 

Although we have not made any attempt here to trans- 
form the scaling parameter x which caracterises the halos 
into luminosity, it is very obvious from both the Fig. 5a 
of BeS92 and Fig. 5 of Benoist et al. (1996) that, within 
the uncertainty in defining the luminosity, we can bring 
our predictions of the bias is in agreement with the data 
(but it is also obvious from these figures that the pro- 
cess looks somewhat easier for the factorized model, e.g. 
Fig. 4.7 of Bernardeau 1992, PhD thesis). Numerical sim- 
ulations (Munshi et al. 1999) seem to favor the minimal 
model. 



5.2. High- order correlation functions 

The new results that have been derived in this paper con- 
cern the high order correlation functions. One can obvi- 
ously calculate order by order the shape of the 4, 5-point 
correlation functions of the halos. 

It requires the expansion of the generating function 
({x, 9) with 9 (that can be done at the level of the equa- 
tions [75-78]). The coefficients of this expansion, v\{x), 
i'2{x), 1^3(0;)... give the values of Sp{x) (Eq. [65]) for 
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growing values of P: 

0^2(3;) 



1.00 



Siix) = 4 
S5ix) = 5 



i^fix) 

l^4{x) 

4{x) 



12 



60 



vl{x) 
vtixY 

V3{x)v2{x) 

vl{x) 



+ 60 



ulixY 



following a tree shape construction (e.g. Bernardeau 
1992). 

The resulting values of S;\(x)-Sq{x) are presented on 
Figs. 4-7. They have been calculated for the minimal and 
the factorized models using n = 1.3. One can see that the 
values of Sp for the factorized model are always positive 
(solid lines) and above the positivity constraints. 

This is not the case for the minimal-tree models 
(dashed lines). 

5.3. The void probability function 

An alternative way to directly constrain the high order 

correlation functions of the halos is to observe their void 
probability function. This is the probability i-*(0) that a 
spherical volume V does not contain any object of a given 
kind, and it is closely related to the behavior of the high- 
order correlation functions of the objects (White 1979). In 
case of the tree-hierarchical models it is possible to relate 
P(0) to the generating function of the vertices C,{d) (BeS92 
-see also Schaeffer, 1984- and appendix B). For the objects 
of kind X we then obtain 



(192) 



where Nc is the value taken by n{x)Vb'^{x)^{V) depending 
on the values of x and V. 

The calculable properties of P{0) are then direct con- 
sequences of the properties of ({x, 9). In Fig. 8 we present 
the shape of the function — lnP(0)/nV" as a function of 
Nc for various values of x and for the two explicit mod- 
els we considered. A change between the void probability 
function in the matter field (assumed to be represented by 
a fair sample of points), and the void probability function 
of halos is seen. This is due to the existence of biases in 
the high order correlation functions, that lead to changes 
in the values of Sp from the matter field to the halo field, 
and are independent of the fact that 6 = 1 or not. For 
the two models there is a common limit for x ^ 00, cor- 
responding to Sp = P^~^, which is independent of x. In 
such a case the void probability function reads. 



n{x)V 

Oe^ = iVc. 



> 



a 



0,10 r 



0.01 L 
0.1 




1000.0 



(193) 



Fig. 8. The void probability function P(0) as a function of 
the reduced variable Nc = nVi,{V), where n is the density 
of the sample, V the volume of a randomly placed cell and 
^{V) is the mean value of the two-point correlation function as 
measured in the sample and within the volume V. The dotted 
line is the form used to describe the matter distribution. The 
solid line is the limit we obtained for the rare halo limit (Eq. 
[192]). The dashed lines corresponds to the minimal model (Eq. 
[102]) for a:: > 0. The dottcd-dashcd line corresponds to the 
factorized-tree model (Eq. [104]) for a; = (or x > since it 
gives the same result). 



The resulting form for — lnP(0)/n(x)V' corresponds to the 
model of Schaeffer (1984) with v = 0. 

The dependence of —lnP{0)/n{x)V with x is quite 
weak at this level of description and most of the bias effects 
are contain in b{x) (especially for the bright end). The 
weak dependence of Sp{x) with x is an illustration, at 
the level of the multi-point correlation functions, of this 
feature. The kind of variation that is expected is, however, 
dependent of the form assumed for C(t, 0). For the form 
(102) this is a growing function of x, for the form (104) 
a decaying. We have thus no universal answer for a set 
of typical galaxies (x « x^). However, for rich clusters or 
bright galaxies we know what should be the shape of the 
void probability function. 

The main advance brought by our models is that it can 
reconcile the measurements in catalogues and in numeri- 
cal simulations in a unique description. The properties of 
the field halos are mainly due to a threshold effect in the 
nonlinear density field (specially for the rare halo limit), at 
contrast with the matter field properties which are solely 
due to the dynamics. It is however quite remarkable to 
note that the "model-independent" requirements 2.3 for 
the count-in-cells precisely call for the properties required 
(3.5) for the above universal asymptotic behavior to hold. 
The Hamilton model, for instance, that does not satisfy 
these requirements, also does not lead to the above behav- 
ior. In this sense, the properties of the field halos definitely 
reflect the underlying matter dynamics. Simply, the con- 
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ditions we imposed already for the count-in-cells to reflect 
that dynamics also induce the above behavior. 

5.4- The mass function in clusters 



of the two fields, P{N{x), N), which is the probability 

that the volume V contains N points of matter and N(x) 
objects of scaling parameter x. The generating function of 
such joint probability distribution is defined by 



E 

N{x),N 



(194) 
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Fig. 9. Change of the mass functions of "galaxies" from the 
ones in the field (dotted lines) to the ones in rich clusters for the 
factorized-tree model (solid line) and the minimal-tree model 
(dashed line) . The mass associated to a "galaxy" is assumed to 
bo independent of the environment. The effect is solely due to 
hierarchical clustering, the fluctuations of the non-linear den- 
sity field being larger within clusters. It is seen to be less im- 
portant with the factorized-tree model (104) in agreement with 
the results (207, 208). 



5.4.1. The general formalism 

In the previous section, correlation properties of the mat- 
ter field, and correlation properties of the halos were con- 
sidered separately. But in fact the general results we get 
hold to describe the auto-correlation functions for a given 
kind of objects, as well as for the cross-correlations be- 
tween objects of different kinds. We present the results 
that follow as an illustration of what can be derived from 
this theory. 

The calculation of the mass multiplicity function of the 
halos, representing galaxies, being in a cluster involves 
the statistics of a two component field. One component 
is the matter field characterized by a density n, a two- 
point correlation function ^2(1", r') and p-body correlation 
functions following a tree-hierarchical form given by the 
vertex generating function C(^'); the second component is a 
halo field characterized by a parameter x which leads to a 
density n{x), and correlation functions given by C(a;, 9). As 
the latter function describes as well the auto-correlations 
as the cross-correlations between the density field and the 
halo field, one can compute the joint density distribution 



and its expression reads, 
X(A„A) = {K-l)n{x)Vax,e) 

-^{X,-l)n{x)V^{x,e) 

+{X-l)nVae)-^{X-l)nV^{e) 

6 = {\,-l)n{x)Vl{V)^{x,9) 



(195) 



We are interested in the expectation number of objects of 
kind x being in a volume V that contains a certain mass 

M = N p, 



Etvc) N{x)P{N{x),N) 

Em.) mix), N) ■ 



(196) 



5.4.2. The rich cluster limit 



The mass M is chosen so that the volume V is rich enough 
to represent the inner part of a cluster (N ^ Nc{V)). 
The calculation of (^N{x))j^j from (195) is quite straight- 
forward. The generating function of 'Ef^(^x^ P{N{x), N) 
is simply exp(x(A^ = 0,A)) whereas the one of X^jv(x) 
N{x)P{N{x),N) is 



?'^(A) = ^(0,A)e'^(0'^). 

The latter expression can be deduced from (195), 
V%X) = n{x)VCix,0)e'''-°'^^ 

9 = {X-l)nVaV)^{0,9). 



(197) 



(198) 



When we are the regime N ^ A^c(^), we can use the con- 
tinuous variable X = N/Nc{V) to describe the content of 
the volume V instead of the variable N. We are thus led 
to calculate. 



j+ioo ^Y({x, 

{N{x))^ = n(a;)ypciuster^r^ 



0{Y) = -Y^{0,0). 



(199) 



When X is large one can compute more precisely the pre- 
vious relation. Its result is dominated by the singularity 
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that lie in y = t/g < in the complex plane. This singu- 
larity is characterized by the equations, 



= -1/^(0, ^a). 



(200) 



The integrations in (199) are then performed around the 

singularity Y = Us which requires the expansion of <^(6) 
and of C{x,9) around 9s- We eventually get, 



dC dC 

{N{x))^ = n{x)V PcluBter -^{xjs)/ 



80 



(201) 



The mass distribution function in rich clusters h'^{x) 
normalized so that J x h'^{x) da; = 1 is then given by. 



h%x) 



{N{x)) 



X 



n{x) V ^cluster 

that reduces to. 



h{x), 



h'^{x) = h{x)^{x,0s)/^^{es). 



(202) 



(203) 



We can notice that the result is independent ofX = N/Nc, 
say the richness of the cluster. 

For any tree- hierarchical model the behavior of ({x, 0) 
has been shown to be close to ex.p[—b{x)9] with b{x) pro- 
portional to X (Eq. [91]), so that h''{x) behaves roughly as 
b{x)h{x) at least when x is large, 



h^ix) ~ b{x)h{x). 



(204) 



As a result the upper cut-off of h(x), characterized by 
X* (Eq. [28]), is shifted towards greater values of x. This 
change can be calculated as soon as a particular form for 
C(t, 9) has been chosen. 

For the factorized-tree model it is possible to give an 
explicit result. In this case, from Eq. (164) we have in the 
rich cluster limit, 



h^x) = - 



1 



h 



h' 



which, from Eq. (175), can be written. 



h^ix) 



( ^ 



, (205) 



(206) 



This is the function which is plotted on Fig. 9 with a solid 
line. 

Thus, for the factorized-tree model we get the position 
of the cut-off for }f(x), x^, 



'k + 2 

K+1 



X^ 



1.6 Xi, for K = 1.3. 



(207) 



In case of the minimal tree-hierarchical model (Eq. 
[102]), the cut-off is given by. 



K+1 



K+2 



3.3 a;* for 



1.3. 



(208) 



One can notice that the effect is similar, although some- 
what more important. 

This has important consequences. For a given cluster 
mass, the galaxy multiplicity function within the cluster 
is expected to be biased towards higher masses, irrespec- 
tive of other phenomena that can change the observed lu- 
minosity function. The M/L ratio within clusters is thus 
expected to be biased and smaller than the average in the 
Universe. 

6. Conclusion 

In this paper we address the issue of the halo correlation 
properties in a strongly non-linear cosmic density field. 
This is a dramatic extension of the results obtained by 
Bardeen et al. (1986) and Mo et al. (1997) for Gaussian 
fields. 

The difficulty is to have a reliable description of the 
matter density fields and its high order correlation prop- 
erties. Our analyses are all based on the hierarchical prop- 
erties (in the sense of Eq. [1]) which seem well established 
in numerical simulations or in the observable Universe. 

We further assume a tree decomposition of the cor- 
relation functions^ (see Szapudi & Szalay 1997, 1998 for 
another approach). These assumptions lead to two crucial 
properties : 

— factorazibility of the mass dependence of the correla- 
tion properties; 

— the bias factors and vertices all depend only on the 
internal properties of the correlated halos, through a 
unique scaling variable x. 

The central formula we derived is given by Eqs. (75-78) 

which explicit the generating function of the vertices that 
describe the halo auto-correlation function as well as halo- 
matter cross-correlations. It provides adapted mathemat- 
ical techniques and tools to explore the relation between 
the intrinsic and joint properties of the halos in a nonlin- 
ear density field. An illustration of the strength of these 
methods is proposed in the last Section. 

Generically, we find that in all relevant cases Sp = 
pP-"^ (which corresponds to vertices that are all equal 
to unity) in the rare halo limit. Also, the mass weighted 
statistics of the halo distribution reduces, when all dense 
halos are included, to the statistics of the matter field, as 
a generalisation of the properties known (BeS92, Schaeffer 
1987) for the two-body correlation function. This is an in- 
dication that the halo correlation properties, especially for 



Note that all these properties are verified at leading order 
in perturbation theory (Bernardeau 1996). 
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the higher masses, reflect the selection effects more than 

the intrinsic properties of the matter field (this behavior 
is similar to what is expected in Gaussian field, e.g. Mo et 
al. 1997). 

In the small mass limit the behavior of the correlation 
functions are more dependent on the underlying matter 
field. Robust results in all regime therefore rely on precise 
assumptions on the geometrical dependence of the correla- 
tion functions. The simplest assumption is to suppose that 
the p-point functions are exact trees in the sense that the 
vertices arc pure numbers. This corresponds to what we 
call the minimal tree-hierarchical model. In this case it is 
possible to relate the correlation properties of halos, and 
in particular to the x dependence of b{x), to their mass 
function. It leads however to mathematical inconsistencies 
(violation of positivity constraints) that we were not able 
to cure within the minimal model. 

As an alternative, we develop a purely phenomeno- 
logical description, the factorized tree-hierarchical model. 
We use this model for illustration purposes, and because 
in this case no mathematical inconsistencies are encoun- 
tered. 

Our results for the bias arc consistent with the ob- 
served luminosity dependence of the bias (see discussion 
in Bes92 and in Benoist et al 1996) although we made no 
attempt here to determine the luminosity of the halos we 
have considered. 

Our results can also be checked against numerical sim- 
ulations. Early results by Bernardcau (1996) were focused 
on the quasilinear regime but more recently Munshi ct al. 
(1999) checked in some details the properties of b{> x) in 
the light of the predictions that were made in our previous 
paper (BeS92) for the minimal model. They found that 
the qualitative and quantitative behavior of b{> x) were 
in perfect agreement to what we expected for the minimal 
tree model, which make the mathematical inconsistencies 
we found all the more puzzling. 

Obviously, further numerical investigations would be 
precious for constraining the models. 
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A. The generating function of the counts in cell 

Let us divide the universe into cells of volume v. The prob- 
ability to find particles of matter in a given cell, p{N), 
can be evaluated by means of the generating function, 

oo 

P(A) = ^A^KA^), (Al) 

JV=0 

with, 

where the integral is calculated in the complex plane 
around A = 0. The generating function ■P(A) is related 
(BaS89, White 1979) to the integrals of the generating 

functions 

V{\) = e^^^\ (A3) 

= E „i / dV... / d% X 

p=i ^' •' V 

xep(ri,...,rp). (A4) 

A.l. The minimal tree-hierarchical model 

For the moment, we assume that the p-body correla- 
tion functions take the form (7) of the minimal tree- 
hierarchical model, 

^p(ri,...,rp) = 

E E n^2(r.,r,), (A5) 

trees (a) labels ta links 

where (a) is a particular tree topology connecting p — 1 
points without making any closed loop, ta is a particular 
labeling of the point coordinates in a given topology (a), 
the last product is made over the p — 1 links between the 
p points (labeled by their coordinates r^), and Qp"'' is a 
normalization parameter associated with the order of the 
correlation and the topology (a) involved that takes the 
form, 

Q^"^= n ""o- (A6) 

vertices of (a) 

The product is made over all the vertices of the tree, and is 
defined once the topology (a) is specified, Uq is the weight 
for each vertex of the tree and depends on the number q 
of outgoing lines (see an example of graph in Fig. 1). 

As shown in detail in a previous paper (Appendix A of 
BeS92), the function x(A) can be constructed by the clas- 
sical "tree-graph" summation Jannink and des Cloiseaux 
1987)(e.(?., Jannink and des Cloiseaux 1987). Let us define, 

°° T-Q 

C(t)=^(-1)'^^,-, (A7) 

g=0 ^' 



then x(A) is given by. 



X(A) = / C[r(r)](A-l)nd3r 

J V 




Fig. 10. Schematic representation of the equations (AlO, A19). 
Each point has the weight (A — 1) n and the lines the value of 
the two-point matter correlation function. The shaded area 
represents the sum of all trees connected to its inner point. 
The function r(x) (in [a]) is the sum of all the trees connected 
to X, and can be decomposed as a lino connected to a vertex, 
connected to as many similar sums. It loads to the implicit 
equation in t(x) given in (A8). The function t(x,x') (in [b]) 
is the sum of all trees, the first line of which is x — x'. It can 
be decomposed in a similar way and it leads to the equation 
(A19). 

The function C[''"('')] represents the vertices at the 
point r with an r-dependent weight T(r) for each outgo- 
ing line. The latter represents the sum of all tree graphs 
starting at the position r. Its value is given by the implicit 
equation (AlO) which states that T(r) equals the specific 
contribution of the first line ^(r, r') multiplied by any kind 
of vertex Ug, the g — 1 remaining lines being dressed by 
as many T(r') factors. The sum over all possible vertices 
introduces the function g^[T(r')] (and not the function 
C[T(r')] since one line is treated apart) and an integra- 
tion over r' is required to finally construct T(r). The Fig. 
10a presents a diagrammatic representation of the implicit 
equation (AlO). 

An important remark that can already be made at 
this stage is that, although the matter particles outside 
the cell are strongly correlated with the ones inside, the 
generating function for the counts in a cell depends only 
on the correlations within this cell. 

The next step is to simplify considerably the equations 
(A8-A10) by means of a "mean field" approximation that 
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turned out to be extremely accurate (about 1 to 2%, see 

discussion in BcS92). Wc integrate (AlO) over r and then 
approximate T(r) by a constant r. The equation (AlO) is 
then changed in, 



with 



- f d^r d^r' 

The equation (A8) then reads, 



(All) 



(A12) 



X(A) = (A - 1) n<(r) - i(A - l)nvT^{T). (A13) 

This is the form that is used for practical calculations. 

In the limit of a continuous distribution, when n goes 
to infinity for a given mass density p, the product (1 — A) n 
remains finite. It will turn out to be convenient to define 
in this case the variable 



y = (1 - X)nv^, 



(A14) 



which remains finite. We are then led to use the function 

</'(y) , 



x(A) = -^(t/)/C, 

and the relations (A11-A13) then read, 
r = -y-{r). 



(A15) 

(A16) 
(A17) 



A. 2. The general hierarchical model 

So far we considered the minimal tree-hierarchical model, 
for which the coefficient Qp°'^ arc given by (A6). In case of 
a general tree-hierarchical model, the equations (A7-A10) 
have to be modified to take into account the dependences 
of the coefficients Qp"^ with the geometry of the vertices. 



The analogous of the equation (AlO) now reads (see 
also Fig. 10b) 



T(r,r') = (A-l)n^(r,r') 



(A19) 



X z/g(r'-r, r'-ri, . . . , r'-r,_i) r(r', ri) . . . T(r', rg_i)] . 

It can be easily checked that this equation gives (AlO) 
when the vertices are pure numbers by an integration over 
r'. The function x(A) is now given by (i^i = 1), 

X(A)= / (A-l)nd3r 



oo „ 



(-l)Q 

(A - l)nd=^rd3ri...d%^^^ 



xug{r - ri, . . . , r - Tp) T(r, ri) . . . T(r, r^). 



(A20) 



which also reduces to (A8) in case of the minimal model. 
In the general case there is still a possible "mean field" 
approximation which just assumes that r(r, r') is propor- 
tional to ^(r, r'). 



r(r,r')«|C(r,r'). 



(A21) 



We are then led to exactly the same equation (A11-A17) 
for x(A) and (p{y) with ({t) given by (A7) and the vertices 
Ug defined by the expectation values 



f dh 

Vq= 

Jv V 



z^g(r-ri,...,r- 



. (A22) 



Obviously, the validity of such an approximation de- 
pends on the strength of the geometrical dependences of 
i/g(r — ri, . . . , r — Tg). We assume here that they are weak 
enough so that the mean field approximation is valid. 

A. 3. The calculation of p{N) 

In the limit of a continuous distribution of matter we have 
the relations 



g(")(ri,...,rp)= H j/,(r,, -ri,...,r,, -r,), (A18) 



Jo 



dm e '"c^p(to) 



where the product is still over all the vertices associated 
with the tree topology (a), is the coordinate of the 
considered vertex and r^^ , . . . , r^^ are the coordinates of 
the end points of the q outgoing lines of the vertex n. The 
difference with the previous calculation is that the vertex 
now depends on the geometry of the outgoing lines. 

In this case we are led to define the function T(r,r') 
representing the sum of all tree graphs starting with the 
line that joins r and r'. The function T(r) defined previ- 
ously is simply the integral over r' of r(r, r') within the 
volume V. 



p{m) dm = J 



-t-ioo 



rric. 



dy -m+^y 
27ri 



with, 

rUc = rfi^, m = pv, , 



(A23) 
(A24) 

(A25) 



where m is the mean mass contained in a cell of volume v. 
The counts in cells are then obtained by a simple Poisson 
convolution of Eq. (A24). 

Realistic models for p{N) and hence (p{y) or ({t) have 
been widely discussed (Schaeffer 1985, BaS89, BeS92). 
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Generically ip{y) must have singularities at negative y and 

behave as at large y, with < < 1. In the 

strongly non- linear regime ^ ^ 1, (A24) can be simpli- 
fied into, 



p{m) dm = — 



dm /■+'°°dy 



/ N JUL. 

- ip(y)e'"cy. 



(A26) 



This form holds provided ^{y)/(, is small. For low values 
of m, however, y oc m^jm gets large and. 



^ ^m' ' 



may not be small enough. The form (A26) thus holds only 
provided, 



m^ = m,c/£,^ 



(A28) 



The halos referred to in this paper are those for which 
(A28) is satisfied. 

B. Joint counts in cells 

We use the same partition of the universe into cells of 

volume Vi (which are not necessarily the same) as in the 
Appendix A and consider a set of Af cells denoted by the 
index i = 1 ... A/". We aim to calculate the probability 
that A^i elementary objects arc in cell 1 at position ri, N2 
in cell 2 at position r2, ... The corresponding generating 
function is, 



JVi Nj^ \i 



Ni 



p{n,,...,Nm). 



(Bl) 



In case x(Ai, . . . , A^v") is known, p{Ni, . . . , Nj^) can be ob- 
tained through. 



dA, 



xe 



,x(Ai,...,AAr) 



(B2) 



We may ask for the probability that cells 1,...,P, 

which are assiimed to all lie with a larger finite vol- 
ume V, contain respectively Ni,. . . , Np elementary par- 
ticles, whatever the contents of the cells P + 1 to Af are. 
This amoimts to sum over Np^i, . . . , Nf^f in (Bl) with 
Ap+i, . . . , Xj\f all equal to unity. It is important to note 
that in this case all integrations over the variables within 
cells P+1 to A/" drop out: the sums simply run from i = 1 
to P. In case P = 1 we simply recover the expressions 
(Al, A2) for the counts in one cell, the solution of which 
is given by the equations (A11-A17). 

From now on we will always implicitly assume that the 
cells 1, . . . , P all lie within a volume V, much larger than 
the volume ( <C V) of the elementary cells. 



B.l. General properties of the generating function x 
B.1.1. The minimal tree-hierarchical model 

Once again the minimal tree-hierarchical model is simpler 

and deserves to be treated separately. The function x c^a-n 
be constructed in a similar way than in (A8, AlO). The 
only change is that the value of A depends on the volume 
in which the corresponding point lies. 
The equation for T(r) now reads. 



p 

(A27) T(r)=^ /(A,-l)ndV^(r,r')^[T(r')]. (B3) 



and the expression of x is, 

X{Xi,...,Xp) = Yl / (Ai-l)nd3rC[T(r)] 

P 

-^E / (^^ - l)nd'rr{r)^[T{r)]. (B4) 

Once again we can introduce a mean field approxima- 
tion. We assume that T(r) has a constant value ti within 
each volume i. The equation for ti can be obtained from 
(B3), 

= (A, - l)nvil '^{U)+Y,{Xj-l)nvjl,j ^iWi^ 



where is the mean value of ^(r, r') within the volume i 
and is its means value when r is in Vi and r' in vj. 
For convenience, we split ti into two terms Ti and 9i, 



n = {Xi-l)nVi£,i^{Ti + ei 



9, = Y^{X,-l)nv,^i,'^{T,+e,]. 



(B6) 

(B7) 



The notation Ti corresponds to graphs the first line of 

which starts and ends within the same box i, whereas 6i 
corresponds to graphs that start in the box i but immedi- 
ately join another box j 7^ i. The function x(Ai, . . . , Ap) 
is now given by, 

p 

x{Xi, . . . ,Xp) = ^{Xi - l)nviC,{Ti + Oi) 



i=l 



p 



(B8) 



i=l 



The same summation can be written in a different way 
that will turn out to be useful in the following. We can 
first focus on one cell. Let us define x(A, 6) by, 



x{x,e)^{x~i)nvC[T{x,e) + e] 

-^(A-l)n^;T(A,^)^[T(A,0)+^], 



(B9) 
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with, 



T{\,e) = {x 



l)nvav)^{r{X,e)+e). 



(BIO) 



These relations define x(A, 0) as a function of two variables 

A and 9 (and the volume of the cell v). This function is the 
sum off all connected tree graphs within the cell (with ver- 
tices only in the cell, all being connected by lines) having 
an arbitrary number of outgoing lines, each weighted by 
a factor 0. The generating function x('^i) • • • ) ^p) can be 
built up with the functions x(Ai, 9),i = 1, . . . , P once 6 is 
specified for each values of i. The basic property required 
for such a construction is that one can relate the partial 
derivative of xi^^^) relatively to 6 (for a fixed A) to the 
derivative of (, 



(Bll) 



with T{X,e) given by (BIO). 

As a result the equation (B7) can be written using the 
function x(A, 6) instead of the function 



12 ^ij i^j'^j)' 



(B12) 



and the function x(Ai, . . . , \p) now reads 



p p 
x(Ai, . . . , Ap) = ^ x(Ai, ^i) - J E ^^)- (B13) 



i=l 



The equations (B12) and (B13) are similar to the equa- 
tions (A11-A17), where C,{t) has been replaced by x(A,^?) 
as a function of 9 and where the matter have been replaced 
by cells of weight (Ai — 1) nvi. The hmction x(A, 9) is the 
sum of all diagrams within a given cell when the environ- 
ment of the cell has a weight 9. The function x('^) in the 
appendix A corresponds to the case when no constraints 
have been put in the vicinity of the cell i, that is given by 
61 = 0, so that x(A) = x(A, = 0). 

The form x(A, 9) is the central relation for the con- 
struction of the statistics of the content of a large cell V 
in terms of the elementary finite cells Vi, instead of the 
elementary points of matter as in the Appendix A. That 
will be the ground for joint count-in-cell calculations. 

B.1.2. The general tree-hierarchical model 

The case of the general tree-hierarchical models is basi- 
cally the same although technically more complicated. 

The equation for T(r,r') now reads when r' is in the 
cell j, 



T(r,r') = (A,--l)nC(r,r') 



oo P 

EE- 

q=2n = l 



P 

E 

»<,-i=i 



/ d^ri.../ d^rg_ii/,(r'-r,r'-ri,...,r'-r<;_i) 



— r(r',ri)...T(r',rg_i) 



(B14) 



(9-1) 

The expression of x has to be changed in a similar way, 

P P oo 

x(Ai,...,Ap) = E / (A,-l)nd3r + E(l-|) 

P P P 

xEE---E /(^^-IW^"-/ d^ri.../ dVi 



a! 



;(r - ri, . . . , r - r,) t{v, Vi) . . . T(r, r^). (B15) 



The extension of the mean field approximation leads to 

assume that, 



(B16) 



whore r is in the volume i and r' in the volume j. The 
value of Ti corresponding to the definition (B6) is, 



f d^r f d^r' , . 
/ / r{r,r) 



whereas. 



(B17) 



(B18) 



Our aim is to calculate (B15) with the mean field ap- 
proximation. The quantities r^j in fact depend not only 
on Ai, . . . , Ap but also on the positions of the cells in the 
volume V. In case of the minimal tree-hierarchical model 
this dependence is completely determined by the values of 
. For the general case the vertices introduce some extra 
geometrical dependences. We then are led to introduce a 
second mean field approximation which is to assume that 
Tij is just proportional to as far as the positions of the 
cells are considered. It just leads to change the equation 
(B16) in, 



r(r,r') 



when r and r' are in Vi and, 
T(r, r ) =Ti.i— , 



(B19) 



(B20) 



when r and r' are in two different cells Vi and vj. ^{V) is 
the mean value of the two-point matter correlation func- 
tion in the large volume V. The transformation of the 
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relation (B14) and (B15) introduces the geometrical aver- 
ages, 

'^<i,Q = 

V V V V J, V V 

xuq{ri -r, ...,rQ -r,rQ+i -r, ...,r, -r) 
^C(ri,r) ^(rQ,r) ^(rQ+i,r) ^(rg,r) 



(B21) 



The index q is the total number of Uncs and Q is the 
number of long lines (going from one cell to another) . As 
soon as V <^V, the averages Vq^Q are independent of the 
partition of V in small cells. These generalized vertices arc 
thus specific of the matter correlation properties. 

We then define the generating function ({t, 9) of this 
vertices 



(B22) 



For the minimal tree-hierarchical model we simply have 
aT,6)=C{T + e). (B23) 
The equation for r, and 6i now reads 



Tj = {\i-l)nvi£,i—{Ti,ei)\ 



(B24) 
(B25) 



and 



x(Ai,...,Ap) = ^(Ai - l)nUiC(Ti,6»j) 

i=l 

1 ^ dC 



P 



1 9C 



(B26) 



The expression of x(Ai,...,Ap) can be transformed as 
previously in a tree-graph summation. We define x(A, 0) 

by 



x{\e)^{X-l)nvC[T{\,6),e] 

A^x-i)nvT{\e)^[T{\e),9] 

with 

T{\e) = {\-i)nvl{v)^[T{\,e),e]- 

We get an equation similar to (Bll), 
g(A,0) = (A-l)n.g(r,^) 



(B27) 



(B28) 



(B29) 



and the function x(Ai, 
B13): 



, Ap) is the same than in (B12, 



p 



de 



(B30) 



x(Ai,...,Ap) = ^x(A.,0,)-^E^»§^(A.,ei). (B31) 



The only difference between the minimal tree model 
and the general case is that in the latter we are led, in 
order to get tractable calculations, to integrate out the 
position dependence of the P cells, and thus replace 
in (B12) by ^(V^)- The information we get from the spa- 
tial dependences of the halo correlation functions is then 
poorer in this case. Nevertheless, for a very large variety 
of apphcations (see Schaeffer 1984, 1985, 1987, and the 
extended discussion in BeS92) we can accurately use the 
averages of these correlation functions in the volume V, 
as it is shown in the next appendix. 

B.1.3. Diagrammatic representations 

The equations (B13) and (B31) are decisive results for 
joint counts in cells calculations. They deserve attention. 
The function x(Ai,...,Ap) is the sum of all connected 
tree graphs with their proper weights, as explained in the 
beginning of this section. These relations mean that such a 
summation can be performed in two steps. Indeed one can 
first compute the sum of all graphs having their vertices in 
a given cell and a given number, say Q, of outgoing lines 
(Fig 11). Let us denote Xq(A) such a sum. The generating 
function for these graphs is precisely the function x(A, 6) 
(Eqs. [B9] and [B27]) calculated in the given cell, 6 being 
the weight attached to each free line. The expansion of 
x(A, 9) in powers of 6, 



X{\0) 



E(-1)'^W(A) 
Q 



9Q_ 



(B32) 



then leads to the corresponding sums Xq(A)- Once the 
sum within an elementary cell is known, x(Ai, • • • , Ap) is 
obtained by connecting (Fig. 12) the different cells in all 
possible ways, each connection between, say cell i and j, 
having a weight for the case of the minimal tree model, 
or simply £,{V) for the general case. These connections 
build a new tree whose elementary vertices are the graphs 
already summed within one cell. 

At this point however loops between cells are not ex- 
cluded. They arise when a cell is reached again after hav- 
ing been connected to various neighboring cells. It can be 
seen in Fig. 13a that the original graph, at the matter 
point level, has no loops according to the rules of the gen- 
eral hierarchical model although the graph at the cell level 
contains one loop. 




Fig. 11. Example of a contribution to the function X3(A). This 
is the sum of all the trees within a given box with three external 
lines. These lines each have the weight 0. 



cosmic density field 



(a) (2) (b) (2) 




Fig. 13. Two examples of contribution to the sum x(-^i) -^2, A3) 
that make intervene the same number of points in each of the 
cells. The first one (a) creates a loop between the cells although 
it is a tree at the matter point level. The second one (b) is a 
tree even at the cells level. The ratio between these two con- 
tributions is of the order of ^13/^(1') (where ^(w) is the mean 
value of the two-point matter correlation function in the vol- 
ume v of a cell) . In most of the relevant cases the contributions 
with loops will then be negligible compare to the trees. 




Fig. 12. Decomposition of the sum x(Ai, ■ ■ ■ , Aj) in product of 
functions XQ(Ai)- The term that appears in the figure involves 
4 cells with the product Xi(Ai)Ci2X3(A2)f2ji63X3(A3)C3j2C34 
X3(A4)^4j3^4j4 .... The external lines are connected to other 
cells of the volume V (with the indices ji j'2, Js and ja) 

B.2. Correlations among cells 
B.2.1. General expression 

Prom the expression of the gener- 
ating function x(Ai, . . . , Ap) of the connected graph wc 
now seek for the probability p{Nx,ri; . . . ; iVp, rp) using 
the general formula (B2). An explicit calculation is possi- 
ble when one assumes that the distances between the cells 
arc large compare to their size so that we have, 

lii^liv). (B33) 

Let us first inspect the structure of xi^i: ■ ■ ■ :^p) with 
the assumption (B33), having in mind wc seek a perturba- 
tion expansion in £,ij/^{v). The function x('^i) • • • > ^p) is 
a sum of tree graphs (Fig. 13) connecting points within the 
various cell i = 1, . . . , P, each connection being weighted 



by the average ^(vi) of the correlation function within the 
cell, and points belonging to two different cells, say i and j, 
with a weight ^ij . That this is the case in general is readily 
seen from equation (B32) where the weight of an external 
line appears indeed to be proportional to whereas an 
internal line has the weight ^. The condition (B33) insures 
that a graph that contains a loop (that is coming back to 
a given cell after having visited other cells) introduces a 
factor <C 1 more than the same graph (same num- 

ber of points in each of the cells) (Fig. 13b) that does not 
contain the loop. In the limit (B33), at leadind order, the 
remaining contribution to x(^i) • • • > ^p) is the sum with 
the smallest number of external lines connecting the dif- 
ferent cells. Whence, in the limit (B33), each cell appears 
only once in each remaining graphs. Each cell (as in Fig. 
12) then appear as a dressed point (see Fig. 11) with out- 
going lines that connect this cell to other ones. 

The sum of the remaining graphs is then a sum of 
product of factors Xq(A) corresponding to different cells: 

x(Ai, . . . , Ap) = ^ C'qi,...,Qp 

tree graphs 

xXQi(Ai)...XQp(Ap), (B34) 

where Cqj^^,,,^Qp is independent of the parameters A. 

This, however is true for x(Ai) • • • , Ap) whereas (B2) 
involves cxp[x(Ai, . . . , Ap)]. Wc therefore should extend 
the above examination to the latter. While the former is 
the sum of all connected tree graphs, the latter is the 
sum of all, connected or disconnected, tree graphs where 
now the contribution of graphs in which all points are 
not linked together is also included (the same way that 
the generating function of the cumulants is given by the 
logarithm of the generating function of the moments). 

From Fig. 14a it is clear the expansion of 
exp[x(Ai, . . . , Ap)] into tree graphs, not only brings in ad- 
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(a) (2) (W (2) 




Fig. 14. Two examples of contribution to p(N-i , N2 , Ns) that 
appear when exp[x(Ai, A2, A3)] is expanded. The first diagram 
is negligible compared to a diagram where a ^ij factor has 
been replaced by a ^ factor that still connect the three boxes. 
The second diagram cannot be a priori neglected : if a 
factor is replaced by a ^ factor the three cells are not connected 
anymore. 



ditional disconnected graphs but also, as previously, loops 

between boxes. We now proceed by selecting the connected 
contributions at the cell level, that is by identifying those 
graphs in which all cells are connected together (but with 
possible disconnected parts within the cells) . These contri- 
butions may thus be built from disconnected diagrams at 
the matter level, that become connected at the cell level. 
However for the same reason as for Fig. 13a, diagrams sim- 
ilar to Fig. 14a that build up loops among the cells have 
a negligible contribution : it is always possible to change 
a factor by a ^ factor and to preserve the connection 
between the boxes. This is not the case for diagrams like 
in Fig. 14b : in such a case changing a factor by a ^ fac- 
tor breaks the connection between the cells. It is therefore 
not possible a priori to neglect the latter contributions. 

Eventually we then are left with diagrams that form 
trees between cells in which the "dressed" vertices of order 
Q corresponding to each cell are obtained by the summa- 
tion of all diagrams - connected or not within the cell- 
having Q outgoing lines. 

We know that the generating function of the con- 
nected diagrams in a cell, with outgoing lines weighted 
by 6, is x(A, ^). The generating function of all diagrams, 
connected or unconnected, is therefore cxp[x(A, 0)] . This 
point can be illustrated for the diagrams with one exter- 
nal line (e.g. on Fig. 15). The connected diagrams with 
one external line are given by Xi(A). Adding diagrams 
with no external lines amounts to multiply Xi(-^) by the 
generating function of those diagrams, exp[x(A)] and the 
fist derivative of exp[x(A, 9)] with respect to 9 is indeed 
Xi(A) exp[x(A)]. 

In a similar way than at the end of the appendix A, 
we may introduce the function (p{y, 9) defined by, 




...= 3^exp[Z ] 



Fig. 15. Contraction of the generating function of b{N)p{N). 

It involves all possible diagrams, connected or not, that have 
one external line. It can be built from the generating function 
of the connected diagrams with one external line, Xi: and the 
generating function of the diagrams with no external lines, X- 

that changes the equation (B27, B28) in 
'fi{y,0) = yar,9)-^yT^{T,9) (B36) 
^ = -y^ir,0) (B37) 

y being defined in (A14). Then in case we are only inter- 
ested in large values of A'', in such a way that, 

A > A^v = (B38) 

then x(-^)^) is of the order of 1/^ {y is finite in Eq. B35, 
see the discussion at the end of A for the case where 
(B38) is not fulfilled). So, exp[x(A,6')] « l + xi\d) so 
that only connected diagrams contribute to the dressed 
vertices. This is also the result we could have obtained di- 
rectly by writing exp[x(Ai, . . . , Ap)] ~ 1 -h x(Ai, • • • , Ap) , 
as can be inferred from the discussion at the beginning of 
this paragraph. 

In all cases, whether A' is larger than Ny or not, each 
cell appears only once in each term of the expansion of 
exp[x(Ai, . . . , Ap)] : at the cell level all diagrams are con- 
nected trees. The integration (B2) over Ai, . . . , Ap is thus 
easily performed, and amounts to replace each factor de- 
pending only on the variable Ai of one specific cell by its 
integral l/(27ri) x JdAi/Af^'^^. This simply amounts to 
replace the sum in a given cell of all graphs with external 
lines exp[x(A, 9)] by 

^(^''^ = 2^/a^^'''''' 

and to use these new generating functions as elementary 
vertices. For dense cells (with the meaning of Eq. [B38]) 
this expression can be simplified as 

^('^'^^=2^/a^^(^'^)- 

As for the matter distribution (Eq. [A7]), we can define 
the generating function of the vertices C(-^) ^) (App. A) 
by 



x{\0) = -ip{y,9)/^{v), 



(B35) 



p{N,9)=p{N)C{N,9) 



(B41) 
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recalling that p{N) = p{N,0). The expansion of Ci^,^) 
in powers of 6 defines new vertices i'q{N), 



fjQ 

C(A^,0)=^(-l)%(iV)-, 
Q ^' 



(B42) 



that replace the parameter I'q relevant for the matter dis- 
tribution. We have, for instance 



Ui{N)p{N) = b{N)p{N) 



27ri/ 



dA 



Xi(A)e' 



X(A). 



(B43) 



^2iN)p{N) = [^,(A)+x?(A)] e>^(^). (B44) 

In the continuous limit we can further make the change of 
variable A — > y (e.g. Eq. A16) and we have, 



b{m)p{m) 



-I 

rric J- 



27ri ^ 



U2{m)p{m) = - 



<p(v)/i+Vi'n/mc. 

<p(2)(y) 



X e 

1 /■+'°° 

. 27ri 



(B45) 



rric. 



X e 



(B46) 

where ip^^\y) and 1,5(^^(2/) are successive derivatives of 
i^{y, 9) with respect to 9. 

For the dense cells we have considered in this paper, 
m ^> mv, that is typically m ^ mc, these vertices, as well 
as the ones of higher order depend on mass only through 
the ratio x = m/mc- This limit will be examined in detail 
in Append. C. For the underdense cells, that cannot be 
called "halos", that is for m <C mc and typically m ~ mv, 
in many cases, there is another scaling law, similar to the 
one found for the count-in-ccUs (BaS89) as well as for 
the bias parameter b (BeS92). The vertices, in this case 
depend on mass only through the parameter z = m/rriv, 
with an overlapping regime for rriv <C m ^ rric. This case, 
its conditions of existence and its implications for the cell 
correlations, will be studied elsewhere. 

The sum over tree connections in all possible ways 
among the cells under consideration is then performed by 
writing, 



p{Nu...,Np) = 



exp 
with^ 



- ^p(iVi) C(iVi, 9i) + lp{Ni) 9i ^{Ni, 9i) 



89' 



(B47) 



(B48) 



Note that wc use the same notation 9 for the expression 
9{Ni,. . . ,Np) of this weight after integration over A (when 
the loops are not included) as for the weight ^(Ai, . . . , Ap) in 
equation (B31), although the two quantities are not the same. 



In case of the general tree-hierarchical model in (B48) 

has to be replaced by (,{V). 

We may similarly ask for the correlation among cells 
which contain more than N elementary particles, and ex- 
plore the effect of a threshold in the matter field. The same 
calculation shows that P, and ( are respectively replaced 

by 



p{> N,9) 
C(> N,9) 



27ri/ 



dA 



A^+i(A-l) 
p{> N,9)/p{> N,0). 



(B49) 
(B50) 



The expansion of C(> N,9) in powers of 9 defines the 
vertices yq{> N) that are to be used to construct the 
many-body correlation functions among cells containing 
more than N particles. 

Similarly, we may also affect a multiplicity N to each 
cell that contains N particles. The related statistics can 
also be described by tree correlations with the functions 



p(>iV,.) = ^/ 



dA 



A^+i(A- 1) dA 
a>N,9) = p{>N,9)/p{>N,Q). 



^ ex(A,0) 



(B51) 
(B52) 



The vertex generating functions obey the normalization 

conditions, 



Y,P{N)aN,9) = 1; 



N 

N 



^=p(iV)C(iV,0) = C(r = O,0). 



N 



(B53) 

(B54) 



In the first sum, the underdense cells are heavily weighted, 
with a negligible contribution of order l/^(w), for the same 
reasons as the ones discussed in Sect. A. 3 for the count- 
in-cells. The last, on the other hand, is heavily weighted 
towards the dense cells, the imderdense contribution van- 
ishing as a power of 1/^(11). These sum rules imply for the 
vertices uq{N), for Q > {i'q{N) = 1 by convention) 



J^PiN^N) = 0; 



JV 

N 



N 



(B55) 
(B56) 



B.3. Counts of halos 

Let us consider a volume V, made of finite elementary 
cells of finite volume v. We assume that, 



(B57) 



In the spirit of the previous section where we calcu- 
lated the correlation among dense cells, we consider now 
the counts of dense cells in the volume V, that is the 
probability to find a given number of "full" sub-cells in 
the volume V. To be precise we may decide a cell is "full" 
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if it contains more than N elementary particles (or exactly 
N..) and "empty" when the converse is true. The cells in 
the volume V may be labeled from i = 1 to i = P. We 
assume the threshold value N to be the same for all cells 
although a generalization with a mixture of thresholds is 
straightforward. 

The generating function for the counts of cells in vol- 
ume V, 



K 



(B58) 



can be defined from the probability P{K) of having K full 

cells in V . From the probability p{> N) for a cell to be 
full, we may define a number density of cells 



n(> N) = -p{> N). 

V 



(B59) 



Now, since we know that the many-body correlation func- 
tions, in the limit V v, are given by a tree-graph series 
with a vertex generating function ^(> N,6), we get the 
equation 

XV W = Y,{X-l)n{>Ni)viC{>Ni,ei) 

i 

-lY.(X-l)n{> Ni)vi9i^{> Ni,0i),{B6O) 



= ^(A - 1) n(> TV,) V, §(> N,,e,), (B61) 



d0' 



which are the same as (AS), (AlO), but written for finite 
cells and with a different vertex generating function C(> 
N,9) that replaces Q{t) of equation (A7). For the general 
tree-hierarchical models has to be replaced by £,iy). 

In case all the volumes of the elementary cells Vi arc 
equal, and all thresholds iVj are the same, we get the ana- 
logue of equations (A11-A17): 



Y 

e 

$(y) 

XV W 



{l-X)n{> N)V^{V); 
-^|(>iV,.); 

YC{>N,e)-\Ye^^{>N,e); 
-^{Y)/l{V). 



(B62) 
(B63) 

(B64) 
(B65) 



These equations together with the equations (B27), (B28) 
and (B49) allows to derive the generating function of the 
counts of halos in the volume V . Once xv{X) is known, one 
can get, by the inversion of the relation (B58), the values 
of P{K). In particular the void probability function in V 
(the absence of dense objects, not the absence of matter) 
can be directly obtained by -P(O) = exp[xy(0)]. 

We can note that in case the threshold is given hy N = 
we should recover the expression of the void probability 



function for the matter field in the volume V which reads, 
Y = nVl{V); (B66) 

^ = -^^W; (B67) 



^{Y) = Ym-lY9^{0y, 



(B68) 

P(0)) = e^^^^="^ = e-^{Y)/l{V). (B69) 

To check this we have to show that n{> 0)C(> 0, 6) = 
n({9). From (B49) we get 

n(> O)z;C(>O,0) = ~x(O,e). 
The latter, from (B27), (B28) is given by 



(B70) 

(B71) 
(B72) 



x{o,e) = -nvaT,e) + ^nvT^{T,e), 

T = -nvl{v)^{T,e). 

If 6 is finite, then Y must be finite too, and this is only 
possible when y is small (since v V) and whence r is 
small too. If 9 is large, then according to the equation 
(B72), T is necessarily small whatever the value of y: its 
value is dominated by the environment. When r is small 
C(t, 9) « C(0, 9) if the function ({t, 9) is regular enough, 
otherwise that would mean the successive mean field ap- 
proximations that we did arc not valid (because the de- 
pendence of the vertices is too strong). Except in these 
cases we see that we recover the right form for ^(> 0, 9). 

C. Scaling laws for dense cells 

In this section we are interested in cases where dense 
cells only are considered. It means that we assume that 
N ^ Ny so that we can apply the formula (B40). We fur- 
ther assume that the density of points is large enough so 
that the continuous limit can be used. It amounts to make 
the change of variable from A to y. One can then use the 
formulae (B35-B37) for x{X,0). 

C.l. Continuous limit 

The continuous limit is obtained by letting the number 

density n of points go to infinity, while keeping the mass 
density p finite. When this limit applies, the cell content 
is described by its mass m (see Sect. A. 3). Remaining in 
the dense cell regime, the results can be described with 
the variable x = rajm^, and we have. 



p(m,6») = 
C{m,9) = 



1 

~! 

h{x,9) 
h{x) 



+ 100 



^^^{y,e)e-y=^=h{x,9) 
C(x,0). 



The function C,{x, 9) generates by a series expansion, 
(-9)Q 



ax,9) = Y,M^)- 
Q 



(01) 
(02) 

(03) 
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the vertices i'p{x) that weight the vertices of the tree ex- 
pansion of the cells. The number density n{x) of a;-type 
cells is in this case 



n(x) = — h(x). 
rUc 



(C4) 



In case we define a "full" cell by a threshold x, h(x,9) 
and h{x) have to be replaced by, 



POO 

h{>x,e) = / h{x,e)dx 

J X 

J-\oo 27ri y 

h{> x) = / h{x) dx = - ^ 
Jx J-ioo 27ri 



27ri y 



(C5) 



(C6) 



and in case each cell is weighted by its content they have 
to be replaced by, 



h{> x,e) 



/ X h{x, 9) dx 

J X 

'■+'~ dy 1 difiy, 



h{> x) 



_ dy 1 

J-ioo 27ri y dy 

POO 

/ X h{x) dx 

J X 

+'°^ dy 1 d^(y) 



27ri y dy 



(C7) 



(C8) 



The function ({x,9), Eq. (C3), and the corresponding 
generating functions deduced from h{> x, 9) or h{> x, 9) 
defines a new set of many-body correlation functions. For 
instance, the vertex, 



vr{x)=^^{x,9) 



(C9) 



61=0 



or its analogues V\{> x), vi{> x) are simply the bias 
factors, b{x), b{> x), b{> x) respectively as presented in 
BeS92. 

The normalization condition (B54) implies 



/ 



xh{x)C{x,9)dx = C{9) (CIO) 
which is a generalization of the sum rule (BeS92), 
y xh{x)b{x)dx = 1, (Cll) 
that can be deduced from (CIO) by the expansion of 

ae) = i-9 + .... 

There is no counterpart to (B53) for which the domain 
N ^ is predominant. 



C.2. The renormalized vertices 

In case of a single set of objects, the observable quantities, 
such as the various moments of the density distribution, 
give information only on Vp{x)/bP{x) since the absolute 
strength of the matter correlation is not directly observ- 
able. 

We are then led to define 



ax,9) 



1-9 + 



62 (x) 

= ax,e/b[x]). 



vz{x) 9^ 
b^{x) 3! 



(C12) 



This function may be determined directly from the ob- 
served galaxy distribution through the void probability 
function. 

C.3. Asymptotic form for C(x, 9) 

The exact form of Q{x, 9) depends on the form taken by 
({r,9). There is however a limit which is independent of 
the exact form of ({r, 9) which corresponds to the case 
a; ^ 00. First of all we derive the expression of the bias. 



b{x) h{x) 



+'°° dv dC 

y^(T,6')e^^ 



27ri " 89 



T{y) = -y-^{T{y),0) 



(C13) 



and h(x) is given by the equation (C2). The behavior of 
b{x) for large values of x can be calculated by the steepest 
descent method, taking advantage of the singularity in the 
implicit equation for r(y). We are then led to define Tg and 
y, so that ^(r,) = 0, 

rs = g(r.,O)/0(r.,O), 

= -i/^(r.,o). 

Then we can derive the expression of the bias, 

'■+'°° dy 92 



(C14) 



Vs 



b{x)h{x) = £^ ^^^^ 
xKx) = /"^ 



xy 



2m dr 



{ts,0){t -Ts)e 



(r„0)(r-T,)e-^. (C15) 



As a result we get the behavior of b{x) when x is large, 

(C16) 



The form (C2) can be calculated with a similar method. 
The position of the singularity has to be determined as 
a function of 9. So we consider ys{9) and Ts{9) so that 



Tsi9) = -l(r„^)/-^(r„0) 



Or 



(C17) 
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We obviously have ys{9 = 0) = Us and Ts{6 = 0) = r^. 
Wc can then write the form of the generating function 

of the vertices, 

The resulting expression for C{x,9) is given by C{x,9) = 
C,{x,9/h{x)) and as h{x) is proportional to x, in the limit 
a; — > DO we have 9/b{x) <^ 1. It implies that 



ya{9) = y^+ 



(C19) 



which leads to 

C{x,e)^e-^ (C20) 

in the limit a; — > oo. This is a generalization of previ- 
ous results obtained by BeS92 where it was shown that 
Qdix) ^ 1 in such a limit. Here we obtain that all the 
vertices fQ(a;) 1. This result is valid for the minimal 
tree-hierarchical model as well as for the general case. In 
the latter case however we can only give the mean behav- 
ior of the many-body correlation functions of the halos as 
a function of scale and the geometrical dependence is not 
known. We thus rather have to define the averages of the 
halo correlation function in a volume V, 

-halo f d^ri d^Tp .halo/^ ^ ^ CPon 

?p = / -y- ■ ■ ■ -y- ?p (ri, • • • , rp) (G21) 

and the coefficient Sp{x) by 

—halo 

Spi^) = . }L p-1 ■ (C22) 



/-haloN ' 
[^2 ) 



These coefficient are mere combinations of the vertices 
£'q(x), and in the fimit a; — > oo we get that Sp{x) is simply 
the number of trees that can be constructed with p points, 
that is 

Spix) = P^-'^ when x ^ oo. (C23) 

The latter limit is valid for any tree-hierarchical models, 
provided some regularity conditions hold for the generat- 
ing function ({t, 9) so as the above expansions are justi- 
fied. 
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